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The present invention relates to polynucleotides, herein referred to as CASB641 1 
polynucleotides, polypeptides encoded thereby (referred to herein as CASB641 1 

5 polypeptides), recombinant materials and methods for their production. In another aspect, 
the invention relates to methods for using such polypeptides and polynucleotides, including 
the treatment of cancer and autoimmune diseases and other related conditions. In a further 
aspect, the invention relates to methods for identifying agonists and antagonists/inhibitors 
using the materials provided by the invention, and treating conditions associated with 

10 CASB641 1 polypeptide imbalance with the identified compounds. In a still further 
aspect, the invention relates to diagnostic assays for detecting diseases associated with 
inappropriate CASB641 1 polypeptide activity or levels. 

Polypeptides and polynucleotides of the present invention are believed to be important 
15 immunogens for specific prophylactic or therapeutic immunization against tumours, because 
they are specifically expressed or highly over-expressed in tumours compared to normal 
cells and can thus be targeted by antigen-specific immune mechanisms leading to the 
destruction of the tumour cell. They can also be used to diagnose the occurrence of tumour 
cells. Furthermore, their inappropriate expression in certain circumstances can cause an 
20 induction of autoimmune, inappropriate immune responses, which could be corrected 
through appropriate vaccination using the same polypeptides or polynucleotides. In this 
respect the most important biological activities to our purpose are the antigenic and 
immunogenic activities of the polypeptide of the present invention. A polypeptide of the 
present invention may also exhibit at least one other biological activity of a CASB641 1 
25 polypeptide, which could qualify it as a target for therapeutic or prophylactic intervention 
different firom that linked to the unmune response. 

In a first aspect, the present invention relates to CASB641 1 polypeptides. Such peptides 
include isolated polypeptides comprising an amino acid sequence which has at least 70% 
30 identity, preferably at least 80% identity, more preferably at least 90% identity, yet more 
preferably at least 95% identity, most preferably at least 97-99% identity, to that of SEQ 
ID N0:2 OR 4 over the entire length of SEQ ID N0:2 OR 4. Such polypeptides include 
those comprising the amino acid of SEQ ID N0:2 OR 4. 
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Further peptides of the present invention mclude isolated polypeptides in which the 
amino acid sequence has at least 70% identity, preferably at least 80% identity, more 
preferably at least 90% identity, yet more preferably at least 95% identity, most 
5 preferably at least 97-99% identity, to the amino acid sequence of SEQ ID N0:2 OR 4 
over the entire length of SEQ ID N0:2 OR 4. Such polypeptides include the polypeptide 
ofSEQIDNO:2 OR 4. 

Further peptides of the present invention include isolated polypeptides encoded by a 
10 polynucleotide comprising the sequence contained in SEQ ID NO: 1 . 

The invention also provides an immunogenic fragment of a CASB641 1 polypeptide, that is 
a contiguous portion of the CASB641 1 polypeptide which has the same or similar 
hnmunogenic properties to the polypeptide comprising the amino acid seqeunce of SEQ ID 

15 N0:2 OR 4. That is to say, the fragment (if necessary when coupled to a carrier) is capable 
of raising an inunune response which recognises the CASB641 1 polypeptide. Such an 
immunogenic fragment may include, for example, the CASB641 1 polypeptide lacking an N- 
temiinal leader sequence, a transmembrane domain or a C-teiminal anchor domain. In a 
preferred aspect the immunogenic fragment of CASB641 1 according to the invention 

20 comprises substantially all of the extracellular domain of a polypeptide which has at least 
70% identity, preferably at least 80% identity, more preferably at least 90% identity, yet 
more preferably at least 95% identity, most preferably at least 97-99% identity, to that of 
SEQ ID N0:2 OR 4 over the entire length of SEQ ID N0:2 OR 4. Preferably an 
hnmunogenic fragment according to the invention comprises at least one epitope. 

25 

Peptide fragments incorporating an epitope of CASB641 1 typically will comprise at least 
7, preferably 9 or 10 contiguous amino acids from SEQ ED N0:2 OR 4. Preferred 
epitopes are shown in SEQ ID N0:9 to SEQ ID NO:72. 

30 Peptides that incorporate these epitopes form a preferred aspect of the present invention. 
Mimotopes which have the same characteristics as these epitopes, and immimogens 
comprising such mimotopes which generate an immune response which cross-react with 



2 
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an epitope in the context of the CASB641 1 molecule, also fonn part of the present 
invention. 

The present invention, therefore, includes isolated peptides encompassing these epitopes 
5 themselves, and any mimotope thereof. The meaning of mimotope is defined as an entity 
which is sufficiently similar to the native CASB641 1 epitope so as to be capable of being 
recognised by antibodies which recognise the native molecule; (Gheysen, H.M., et al., 
1986, Synthetic peptides as antigens. Wiley, Chichester, Ciba foundation symposium 
1 19, pl30-149; Gheysen, H.M., 1986, Molecular Immunology, 23,7, 709-715); or are 
10 capable of raising antibodies, when coupled to a suitable carrier, which antibodies cross- 
react with the native molecule. 

Peptide mimotopes of the above-identified epitopes may be designed for a particular 
piupose by addition, deletion or substitution of elected amino acids. Thus, the peptides of 

15 the present invention may be modified for the purposes of ease of conjugation to a protein 
carrier. For example, it may be desirable for some chemical conjugation methods to 
include a terminal cysteine to the epitope. In addition it may be desirable for peptides 
conjugated to a protein carrier to include a hydrophobic terminus distal from the • 
conjugated terminus of the peptide, such that the free unconjugated end of the peptide 

20 remains associated with the surface of the carrier protein. This reduces the 

conforaiational degrees of freedom of the peptide, and thus increases the probability that 
the peptide is presented in a conformation which most closely resembles that of the 
peptide as found in the context of the whole molecule. For example, the peptides may be 
altered to have an N-terminal cysteine and a C-terminal hydrophobic amidated tail. 

25 Alternatively, the addition or substitution of a D-stereoisomer form of one or more of the 
amino acids may be performed to create a beneficial derivative, for example to enhance 
stability of the peptide. Those skilled in the art will realise that such modified peptides, or 
mimotopes, could be a wholly or partly non-peptide mimotope wherein the constituent 
residues are not necessarily confined to the 20 naturally occurring amino acids. In 

30 addition, these may be cyclised by techniques known in the art to constrain the peptide 
into a conformation that closely resembles its shape when the peptide sequence is in the 
context of the whole molecule. A preferred method of cyclising a peptide comprises the 
addition of a pair of cysteine residues to allow the formation of a disulphide bridge. 
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Further, those skilled in the art will realise that mimotopes or immunogens of the present 
invention may be larger than the above-identified epitopes, and as such may comprise the 
sequences disclosed herein. Accordingly, the mimotopes of the present invention may 

5 consist of addition of N and/or C terminal extensions of a number of other natural 

residues at one or both ends. The peptide mimotopes may also be retro sequences of the 
natural sequences, in that the sequence orientation is reversed; or altematively the 
sequences may be entirely or at least in part comprised of D-stereo isomer amino acids 
(inverse sequences). Also, the peptide sequences may be retro-inverso in character, in 

10 that the sequence orientation is reversed and the amino acids are of the D-stereoisomer 
form. Such retro or retro-inverso peptides have the advantage of being non-self, and as 
such may overcome problems of self-tolerance in the immune system. 

Altematively, peptide mimotopes may be identified using antibodies which are capable 
15 themselves of binding to the epitopes of the present invention using techniques such as 
phage display technology (EP 0 552 267 BI). This technique, generates a large number of 
peptide sequences which mimic the structure of the native peptides and are, therefore, 
capable of binding to anti-native peptide antibodies, but may not necessarily themselves 
share significant sequence homology to the native peptide. This approach may have 
20 significant advantages by allowing the possibility of identifying a peptide with enhanced 
immunogenic properties, or may overcome any potential self-antigen tolerance problems 
which may be associated with the use of the native peptide sequence. Additionally this 
technique allows the identification of a recognition pattern for each native-peptide in 
terms of its shared chemical properties amongst recognised mimotope sequences. 

25 

The covalent coupling of the peptide to the immimogenic carrier can be carried out in a 
manner well known in the art. Thus, for example, for direct covalent coupling it is 
possible to utilise a carbodiimide, glutaraldehyde or (N-[y-maleimidobutyryloxy) 
succinimide ester, utilising conunon commercially available heterobifunctional linkers 
30 such as CDAP and SPDP (using manufacturers instructions). After the coupling reaction, 
the immunogen can easily be isolated and purified by means of a dialysis method, a gel 
filtration method, a fi^ctionation method etc. 



4 



wo 01/23417 



PCT/EPOO/09500 



The types of carriers used in the immunogens of the present invention will be readily 
known to the man skilled in the art. The function of the carrier is to provide cytokine help 
in order to help induce an inunune response against the peptide. A non-exhaustive list of 
carriers which may be used in the present invention include: Keyhole limpet 

5 Haemocyanin (KLH), serum albumins such as bovine serum albumin (BSA), inactivated 
bacterial toxins such as tetanus or diptheria toxins (TT and DT), or recombinant 
fragments thereof (for example, Domain 1 of Fragment C of TT, or the translocation 
domain of DT), or the pxirified protein derivative of tuberculin (PPD). Ahematively the 
mimotopes or epitopes may be directly conjugated to liposome carriers, which may 

10 additionally comprise immunogens capable of providing T-cell help. Preferably the ratio 
of mimotopes to carrier is in the order of 1:1 to 20:1, and preferably each carrier should 
carry between 3-15 peptides. 

In an embodiment of the invention a preferred carrier is Protein D from Haemophilus 
15 influenzae (EP 0 594 610 Bl). Protein D is an IgD-binding protein from Haemophilus 
influenzae and has been patented by Forsgren (WO 91/18926, granted EP 0 594 610 Bl). 
In some circumstances, for example in recombinant immunogen expression systems it 
may be desirable to use fragments of protem D, for example Protein D 1/3^** (comprising 
the N-terminal 100-110 amino acids of protein D (GB 9717953.5)). 

20 

Another preferred method of presenting the peptides of the present invention is in the 
context of a recombinant fusion molecule. For example, EP 0 421 635 B describes the use 
of chimaeric hepadnavfrus core antigen particles to present foreign peptide sequences in a 
virus-like particle. As such, inmiunogens of the present invention may comprise peptides 
25 presented in chimaeric particles consisting of hepatitis B core antigen. Additionally, the 
recombinant fusion proteins may comprise the mimotopes of the present invention and a 
carrier protein, such as NSl of the influenza virus. For any recombinantly expressed 
protein which forms part of the present invention, the nucleic acid which encodes said 
immunogen also forms an aspect of the present invention. 

30 

Peptides used in the present invention can be readily synthesised by solid phase 
procedures well known in the art. Suitable syntheses may be performed by utilising "T- 
boc" or "F-moc*' procedures. Cyclic peptides can be synthesised by the solid phase 
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procedure employing the well-known "F-moc" procedure and polyamide resin in the fully 
automated apparatus. Alternatively, those skilled in the art will know the necessary 
laboratory procedures to perform the process manually. Techniques and procedures for 
solid phase synthesis are described in 'Solid Phase Peptide Synthesis: A Practical 

5 Approach' by E. Atherton and R.C. Sheppard, published by IRL at Oxford University 
Press (1989). Alternatively, the peptides may be produced by recombinant methods, 
including expressing nucleic acid molecules encoding the mimotopes in a bacterial or 
mammalian cell line, followed by purification of the expressed mimotope. Techniques for 
recombinant expression of peptides and proteins are known in the art, and are described 

10 in Maniatis, T.. Fritsch, E.F. and Sambrook et al.. Molecular cloning, a laboratory 

manual, 2nd Ed.; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York 
(1989). 

The polypeptides or immunogenic fragment of the invention may be in the form of the 
15 "mature" protein or may be a part of a larger protein such as a precursor or a fusion 
protein. It is often advantageous to include an additional amino acid sequence which 
contains secretory or leader sequences, pro-sequences, sequences which aid in 
purification such as multiple histidine residues, or an additional sequence for stability 
during recombinant production. Furthermore, addition of exogenous polypeptide or lipid 
20 tail or polynucleotide sequences to increase the immunogenic potential of the final 
molecule is also considered. 

In one aspect, the invention relates to genetically engineered soluble fusion proteins 
comprising a polypeptide of the present invention, or a fragment thereof, and various 

25 portions of the constant regions of heavy or Hght chains of immunoglobulins of various 
subclasses (IgG, IgM, IgA, IgE). Preferred as an inmiunoglobulin is the constant part of 
the heavy chain of human IgG, particularly IgGl, where fusion takes place at the hinge 
region. In a particular embodiment, the Fc part can be removed simply by incorporation 
of a cleavage sequence which can be cleaved with blood clotting factor Xa. Furthermore, 

30 this invention relates t6 processes for the preparation of these fusion proteins by genetic 
engineering, and to the use thereof for drug screening, diagnosis and therapy. A fiuther 
aspect of the invention also relates to polynucleotides encoding such fusion proteins. 
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Examples of fusion protein technology can be found in International Patent Application 
Nos. W094/29458 and W094/22914. 

The proteins may be chemically conjugated, or expressed as recombinant fusion proteins 
5 allowing increased levels to be produced m an expression system as compared to non- 
fiised protein. The fusion partner may assist in providing T helper epitopes 
(immunological fusion partner), preferably T helper epitopes recognised by humans, or 
assist in expressing the protein (expression enhancer) at higher yields than the native 
recombinant protein. Preferably the fusion partner will be both an immunological fusion 
10 partner and expression enhancing partner. 

Fusion partners include protein D from Haemophilus influenza B and the non-structural 
protein from influenzae virus, NSl (hemagglutinin). Another immunological fusion 
partner is the protein known as LYTA. Preferably the C terminal portion of the molecule 

15 is used. Lyta is derived from Streptococcus pneumoniae which synthesize an N-acetyl-L- 
alanine amidase, amidase LYTA, (coded by the lytA gene {Gene, 43 (1986) page 265- 
272) an autolysin that specifically degrades certain bonds in the peptidoglycan backbone. 
The C-terminal domain of the LYTA protein is responsible for the affinity to the choline 
or to some choHne analogues such as DEAE. This property has been exploited for the 

20 development of Exoli C-LYTA expressing plasmids useful for expression of fusion 
proteins. Purification of hybrid proteins containing the C-LYTA fragment at its amino 
terminus has been described {Biotechnology: 10, (1992) page 795-798}. It is possible to 
use the repeat portion of the Lyta molecule found in the C terminal end starting at residue 
178, for example residues 188 - 305. 

25 

The present invention also includes variants of the aforementioned polypeptides, that is 
polypeptides that vary from the referents by conservative amino acid substitutions, whereby 
a residue is substituted by another with like characteristics. Typical such substitutions are 
among Ala, Val, Leu and Be; among Ser and Thr, among the acidic residues Asp and Glu; 
30 among Asn and Gin; and among the basic residues Lys and Arg; or aromatic residues Phe 
and Tyr. Particularly preferred are variants in which several, 5-10, 1-5, 1-3, 1-2 or 1 amino 
acids are substituted, deleted, or added in any combination. 
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Polypeptides of the present invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well understood 
5 in the art. 

In a further aspect, tiie present invention relates to CASB641 1 polynucleotides. Such 
polynucleotides include isolated polynucleotides comprising a nucleotide sequence encoding 
a polypeptide which has at least 70% identity, preferably at least 80% identity, more 

10 preferably at least 90% identity, yet more preferably at least 95% identity, to the amino 
acid sequence of SEQ ID N0:2 OR 4, over the entire length of SEQ ID N0:2 OR 4. In this 
regard, polypeptides which have at least 97% identity are highly preferred, whilst tiiose with 
at least 98-99% identity are more highly preferred, and those with at least 99% identity are 
most highly preferred. Such polynucleotides include a polynucleotide comprising the 

1 5 nucleotide sequence contained in SEQ ID NO: 1 or 3 encoding the polypeptide of SEQ ID 
N0:2 OR 4 respectively. 

Further polynucleotides of the present invention include isolated polynucleotides comprising 
a nucleotide sequence that has at least 70% identity, preferably at least 80% identity, more 
20 preferably at least 90% identity, yet more preferably at least 95% identity, to a nucleotide 
sequence encoding a polypeptide of SEQ ED N0:2 OR 4, over the entire coding region. In 
this regard, polynucleotides which have at least 97% identity are highly preferred, whilst 
those with at least 98-99% identity are more highly preferred, and those with at least 99% 
identity are most highly preferred. 

25 

Further polynucleotides of the present invention include isolated polynucleotides 
comprising a nucleotide sequence which has at least 70% identity, preferably at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% identity, 
to SEQ ID NO: 1 or 3 or to the coding sequence of SEQ ID NO: 1 or 3 over the entire 
30 length of SEQ ID NO: 1 or 3 or over the entire length of the coding sequence of SEQ ED 
NO: 1 or 3 respectively. In this regard, polynucleotides which have at least 97% identity are 
highly preferred, whilst those with at least 98-99% identiy are more highly preferred, and 
those with at least 99% identity are most highly preferred. Such polynucleotides include a 

8 
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polynucleotide comprising the polynucleotide of SEQ ID N0:1 or 3 as well as the 
polynucleotide of SEQ ID NO: 1 or 3 or the coding region of SEQ ID N0;1 or 3. Said 
polynucleotide can be inserted in a suitable plasmid or recombinant microrganism vector 
and used for immunization ( see for example Wolff et. al., Science 247:1465-1468 (1990); 
5 Corr et al, J. Exp. Med. 184:1555-1560 (1996); Doe et. al, Proc. Natl Acad. Sci. 93:8578- 
8583 (1996)). 

The invention also provides polynucleotides which are complementary to all the above 
described polynucleotides. 

10 

The invention also provides a fragment of a CASB641 1 polynucleotide which when 
administered to a subject has the same immunogenic properties as the polynucleotide of 
SEQ ID NO: lor 3. 

15 The invention also provides a polynucleotide encoding an immunological fragment of a 
CASB6411 polypeptide as hereinbefore defined. 

The firagments have a level of immunogenic activity of at least about 50%, preferably at 
least about 70% and more preferably at least about 90% of the level of immunogenic 
20 activity of a polypeptide sequence set forth in SEQ ID N0:2 OR 4 or a polypeptide 
sequence encoded by a polynucleotide sequence set forth in SEQ ID NO: 1 . 

The polypeptide fragments according to the invention preferably comprise at least about 
5, 10, 15, 20, 25, 50, or 100 contiguous amino acids, or more, including all intermediate 
25 lengths, of a polypeptide composition set forth herein, such as those set forth in SEQ ID 
N0:2 OR 4, or those encoded by a polynucleotide sequence set forth in a sequence of 
SEQIDNO:!. 

The nucleotide sequence of SEQ ID N0:1 is a cDNA sequence which comprises a 
30 polypeptide encoding sequence (nucleotide 349 to 1 761) encoding a polypeptide of 460 
amino acids, the polypeptide of SEQ ID N0:2. The nucleotide sequence encoding the 
polypeptide of SEQ ID N0:2 may be identical to the polypeptide encoding sequence 
contained in SEQ ID NO: 1 or it may be a sequence other than the one contained in SEQ 

9 
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ID NO: 1 , which, as a result of the redundancy (degeneracy) of the genetic code, also 
encodes the polypeptide of SEQ ID N0:2. The polypeptide of the SEQ ID N0:2 is 
structurally related to other proteins of the LAK-4p family, having homology and/or 
structural similarity with Homo sapiens LAK-4p (GenBank accession BAA24179). 

5 

SEQ ID N0:3 is a cDNA sequence which is an alternative isoform of CASB641 1 SEQ ID 
N0:1, and probably the result of alternative exon splicing. SEQ ID N0:3 comprises a 
polypeptide encoding sequence (nucleotide 382 to 844) encoding a polypeptide of 154 
aminoacids, the polypeptide of SEQ ID N0:4. SEQ ID N0:4 is a truncated form of SEQ ID 
10 N0:2 polypeptide. The nucleotide sequence encoding the polypeptide of SEQ ID N0:4 
may be identical to the polypeptide encoding sequence contained in SEQ ID N0:3 or it 
may be a sequence other than the one contained in SEQ ED N0:3, which, as a result of the 
redundancy (degeneracy) of the genetic code, also encodes the polypeptide of SEQ ID 
N0:4. 

15 

Preferred polypeptides and polynucleotides of the present invention are expected to have, 
inter alia, similar biological functions/properties to their homologous polypeptides and 
polynucleotides. Furthermore, preferred polypeptides, immunological fragments and 
polynucleotides of the present invention have at least one activity of either SEQ ID N0:1 or 
20 3 or SEQ ID N0:2 OR 4, as appropriate. 



The present invention also relates to partial or other incomplete polynucleotide and 
polypeptide sequences which were first identified prior to the determination of the 
corresponding full length sequences of SEQ ID N0:1 or 3 and SEQ ID N0:2 OR 4. 

25 

Accordingly, in a further aspect, the present invention provides for an isolated 
polynucleotide which: 

(a) comprises a nucleotide sequence which has at least 70% identity, preferably at least 
80% identity, more preferably at least 90% identity, yet more preferably at least 95% 

30 identity, even more preferably at least 97-99% identity to SEQ ID N0:5 and 7 over the 
entu-e length of SEQ ID N0:5 and 7, respectively; 

(b) has a nucleotide sequence which has at least 70% identity, preferably at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% identity, 

10 
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even more preferably at least 97-99% identity, to SEQ ID NO: 1 or 3 over the entire 
lengthofSEQIDNO:5and7; 

(c) the polynucleotide of SEQ ID N0:5 and 7; or 

(d) a nucleotide sequence encoding a polypeptide which has at least 70% identity, 
5 preferably at least 80% identity, more preferably at least 90% identity, yet more 

preferably at least 95% identity, even more preferably at least 97-99% identity, to the 
amino acid sequence of SEQ ID N0:6 and 8, over the entire length of SEQ ID N0:6 and 8 , 
respectively,; 

as well as the polynucleotide of SEQ ID N0:5 and 7. 

10 

The present invention further provides for a polypeptide which: 

(a) comprises an amino acid sequence which has at least 70% identity, preferably at least 
80% identity, more preferably at least 90% identity, yet more preferably at least 95% 
identity, most preferably at least 97-99% identity, to that of SEQ ID N0:6 OR 8 over the 

15 entire length of SEQ ID N0:6 and 8; 

(b) has an amino acid sequence which is at least 70% identity, preferably at least 80% 
identity, more preferably at least 90% identity, yet more preferably at least 95% identity, 
most preferably at least 97-99% identity, to the amino acid sequence of SEQ ID N0:6 OR 
8 over the entire length of SEQ ID N0:6 and 8; 

20 (c) comprises the amino acid of SEQ ID N0:6 and 8; and 
(d) is the polypeptide of SEQ ID N0:6 and 8; 

as well as polypeptides encoded by a polynucleotide comprising the sequence contained 
inSEQIDN0:5 and 7. 

25 Polynucleotides of the present invention may be obtained, using standard cloning and 
screening techniques, from a cDNA Ubrary derived from mRNA in cells of human, 
colorectal tumours, stomach tumors and nonnal stomach, normal pancreas and pancreas 
tumors, ovarian tumors, lung tumors, and normal prostate, (for example Sambrook et al., 
Molecular Cloning: A Laboratory Manual, 2"** Ed., Cold Spring harbor Laboratory Press, 

30 Cold Spring harbor, N.Y. (1989)). Polynucleotides of the invention can also be obtained 
from natural sources such as genomic DNA libraries or can be synthesized using well 
known and commercially available techniques. 

11 
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When polynucleotides of the present invention are used for the recombinant production of 
polypeptides of the present invention, the polynucleotide may include the coding sequence 
for the mature polypeptide, by itself; or the coding sequence for the mature polypeptide in 
reading jframe with other coding sequences, such as those encoding a leader or secretory 
5 sequence, a pre-, or pro- or prepro- protein sequence, or other fusion peptide portions. For 
example, a marker sequence which facilitates purification of the fused polypeptide can be 
encoded. In certain preferred embodiments of this aspect of the mvention, the marker 
sequence is a hexa-histidine peptide, as provided in the pQE vector (Qiagen, hic.) and 
described m Gentz et al, Proc Natl Acad Sci USA (1989) 86:821-824, or is an HA tag. The 
10 polynucleotide may also contain non-coding 5' and 3' sequences, such as transcribed, non- 
translated sequences, splicing and polyadenylation signals, ribosome binding sites and 
sequences that stabilize mRNA. 

Further embodiments of the present invention include polynucleotides encoding polypeptide 
1 5 variants which comprise the amino acid sequence of SEQ E) N0:2 OR 4 and in which 
several, for instance from 5 to 10, 1 to 5, 1 to 3, 1 to 2 or 1, amino acid residues are 
substituted, deleted or added, in any combination. 

Polynucleotides which are idratical or sufficiently identical to a nucleotide sequence 
20 contained in SEQ ID NO: 1 or 3, may be used as hybridization probes for cDNA and 

genomic DNA or as primers for a nucleic acid amphfication (PGR) reaction, to isolate full- 
length cDNAs and genomic clones encoding polypeptides of the present invention and to 
isolate cDNA and genomic clones of other genes (including genes encoding paralogs from 
human sources and orthologs and paralogs from species other than human) that have a high 
25 sequence similarity to SEQ ID N0:1 or 3. Typically these nucleotide sequences are 70% 
identical, preferably 80% identical, more preferably 90% identical, most preferably 95% 
identical to that of the referent. The probes or primers will generally comprise at least 1 5 
nucleotides, preferably, at least 30 nucleotides and may have at least 50 nucleotides. 
Particularly preferred probes will have between 30 and 50 nucleotides. Particularly 
30 preferred primers will have between 20 and 25 nucleotides. In particular, polypeptides or 
polynucleotides derived from sequences from homologous animal origin could be used as 
immunogens to obtain a cross-reactive immune response to the human gene. 

12 
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A polynucleotide encoding a polypeptide of the present invention, including homologs from 
species other than human, may be obtained by a process which comprises the steps of 
screening an appropriate library under stringent hybridization conditions with a labeled 
probe having the sequence of SEQ ID NO: 1 or 3 or a fragment thereof; and isolating fiill- 
5 length cDNA and genomic clones containing said polynucleotide sequence. Such 
hybridization techniques are well known to the skilled artisan. Preferred stringent 

hybridization conditions include overnight incubation at 42^C in a solution comprising: 50% 
formamide, 5xSSC (ISOmMNaCl, l5mM trisodium citrate), 50 mM sodium phosphate 
(pH7.6), 5x Denhardfs solution, 10 % dextran sulfate, and 20 microgram/ml denatured, 

10 sheared salmon sperm DNA; followed by washing the filters in 0. Ix SSC at about 65^C. 
Thus the present invention also includes polynucleotides obtainable by screening an 
appropriate library under stingent hybridization conditions with a labeled probe having the 
sequence of SEQ ID NO: 1 or 3 or a fragment thereof 

15 The skilled artisan will appreciate that, in many cases, an isolated cDNA sequence will be 
incomplete, in that the region coding for the polypeptide is short at the 5' end of the 
cDNA. 

There are several methods available and well known to those skilled in the art to obtain 
20 full-length cDNAs, or extend short cDNAs, for example those based on the method of 
Rapid Amplification of cDNA ends (RACE) (see, for example, Frohman et al, PNAS 
USA 85, 8998-9002, 1988). Recent modifications of the technique, exemplified by the 
Marathon*^ technology (Clontech Laboratories Inc.) for example, have significantly 
simplified the search for longer cDNAs. In the Marathon*^ technology, cDNAs have 
25 been prepared from mRNA extracted from a chosen tissue and an 'adaptor* sequence 

ligated onto each end. Nucleic acid amplification (PGR) is then carried out to amplify the 
'missing' 5' end of the cDNA using a combination of gene specific and adaptor specific 
oligonucleotide primers. The PGR reaction is then repeated using 'nested' primers, that is, 
primers designed to aimeal within the amplified product (typically an adaptor specific 
30 primer that anneals further 3' in the adaptor sequence and a gene specific primer that 

aimeals further 5* in the known gene sequence). The products of this reaction can then be 
analysed by DNA sequencing and a full-length cDNA constructed either by joining the 

13 
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product directly to the existing cDNA to give a complete sequence, or carrying out a 
separate full-length PGR using the new sequence information for the design of the 5' 
primer. 

5 Recombinant polypeptides of the present invention may be prepared by processes well 
known in the art from genetically engineered host cells comprising expression systems. 
Accordingly, in a further aspect, the present invention relates to an expression system which 
comprises a polynucleotide of the present invention, to host cells which are genetically 
engineered with such expression sytems and to the production of polypeptides of the 

10 invention by recombinant techniques. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present 
invention. 

For recombinant production, host cells can be genetically engineered to incorporate 
15 expression systems or portions thereof for polynucleotides of the present invention. 

Introduction of polynucleotides into host cells can be effected by methods described in many 
standard laboratory manuals, such as Davis et aL, Basic Methods in Molecular Biology 
(1986) and Sambrook et aly Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989). Preferred such methods 
20 include, for instance, calcium phosphate transfection, DEAE-dextran mediated transfection, 
transvection, microinjection, cationic lipid-mediated transfection, electroporation, 
transduction, scrape loading, ballistic introduction or infection. 

Preferably the proteins of the invention are coexpressed with thioredoxin in trans (TIT). 
25 Coexpression of thioredoxin in trans versus in cis is preferred to keep antigen free of 
thioredoxin without the need for protease. Thioredoxin coexpression eases the 
solubilisation of the proteins of the invention. Thioredoxin coexpression has also a 
significant impact on protein purification yield, on purified-protein solubility and quality, 

30 Representative examples of appropriate hosts include bacterial cells, such as Streptococci, 
Staphylococci, E, coli, Streptomyces and Bacillus subtilis cells; firngal cells, such as yeast 
cells and Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; 

14 
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animal cells such as CHO, COS, HeLa, C127, 3T3, BHK, HEK293 and Bowes melanoma 
cells; and plant cells. 

A great variety of expression systems can be used, for instance, chromosomal, episomal 

5 and virus-derived systems, e.g., vectors derived from bacterial plasmids, from 

bacteriophage, from transposons, from yeast episomes, from insertion elements, from 
yeast chromosomal elements, from viruses such as baculoviruses, papova viruses, such as 
SV40, vaccinia viruses, adenoviruses, fowl pox viruses, pseudorabies viruses and 
retroviruses, and vectors derived from combinations thereof, such as those derived from 

10 plasmid and bacteriophage genetic elements, such as cosmids and phagemids. The 
expression systems may contain control regions that regulate as well as engender 
expression. Generally, any system or vector which is able to maintain, propagate or 
express a polynucleotide to produce a polypeptide in a host may be used. The appropriate 
nucleotide sequence may be mserted into an expression system by any of a variety of 

15 well-known and routine techniques, such as, for example, those set forth in Sambrook et 
ai. Molecular Cloning, A Laboratory Manual (supra). Appropriate secretion signals may 
be incorporated into the desired polypeptide to allow secretion of the translated protein 
into the lumen of the endoplasmic reticulum, the periplasmic space or the extracellular 
environment. These signals may be endogenous to the polypeptide or they may be 

20 heterologous signals. 

The expression system may also be a recombinant live microorganism, such as a virus or 
bacterium. The gene of interest can be inserted into the genome of a live recombinant 
virus or bacterium. Inoculation and in vivo infection with this live vector will lead to in 
25 vivo expression of the antigen and induction of immune responses. 

Therefore, in certain embodiments, polynucleotides encoding immunogenic polypeptides 
of the present invention are introduced into suitable mammalian host cells for expression 
using any of a number of known viral-based systems. In one illustrative embodiment, 
30 retroviruses provide a convenient and effective platform for gene delivery systems. A 
selected nucleotide sequence encoding a polypeptide of the present invention can be 
inserted into a vector and packaged in retroviral particles using techniques known in the 
art. The recombinant virus can then be isolated and delivered to a subject. A number of 

15 
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illustrative retroviral systems have been described (e.g., U.S. Pat. No. 5,219.740; Miller 
and Rosman (1989) BioTechniques 7:980-990; Miller, A. D. (1990) Human Gene 
Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Bums et al. (1993) Proc. 
Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin (1993) Cur. Opin. 
5 Genet. Develop. 3:102-109. 

In addition, a number of illustrative adenovinis-based systems have also been described. 
Unlike retroviruses which integrate into the host genome, adenoviruses persist 
extrachromosomally thus minimizmg the risks associated with insertional mutagenesis 
10 (Haj-Ahmad and Graham (1986) J. Virol. 57:267-274; Bett et al. (1993) J. Virol. 
67:591 1-5921; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et al 
(1994) J. Virol. 68:933-940; Barr et al. (1994) Gene Therapy 1:51-58; Berkner, K. L. 
(1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 
476). 

15 

Various adeno-associated virus (AAV) vector systems have also been developed for 
polynucleotide delivery. AAV vectors can be readily constructed using techniques well 
known in the art. See, e.g., U.S. Pat. Nos. 5,173,414 and 5,139,941; International 
Publication Nos. WO 92/01070 and WO 93/03769; Lebkowski et al. (1988) Molec. Cell. 
20 Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring Harbor Laboratory 
Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533-539; Muzyczka, N. 
(1992) Current Topics in Microbiol, and Immunol. 158:97-129; Kotin, R. M. (1994) 
Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene Therapy 1:165-169; 
and Zhou et al. (1994) J. Exp. Med. 179:1867-1875. 

25 Additional viral vectors useful for delivering the nucleic acid molecules encoding 

polypeptides of the present invention by gene transfer include those derived from the pox 
family of viruses, such as vaccinia virus and avian poxvirus. By way of example, vaccinia 
virus recombinants expressing the novel molecules can be constructed as follows. The 
DNA encoding a polypeptide is first inserted into an appropriate vector so that it is 

30 adjacent to a vaccinia promoter and flanking vaccinia DNA sequences, such as the 
sequence encoding thymidine kinase (TK). This vector is then used to transfect cells 

16 
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which are simultaneously infected with vaccinia. Homologous recombination serves to 
insert the vaccinia promoter plus the gene encoding the polypeptide of interest into the 
viral genome. The resulting TK.sup.(-) recombinant can be selected by culturing the cells 
in the presence of 5-bromodeoxyuridine and picking viral plaques resistant thereto. 

5 

A vaccinia-based infection/transfection system can be conveniently used to provide for 
inducible, transient expression or coexpression of one or more polypeptides described 
herein in host cells of an organism. In this particular system, cells are first infected in 
vitro with a vaccinia virus recombinant that encodes the bacteriophage T7 RNA 

10 polymerase. This polymerase displays exquisite specificity in that it only transcribes 
templates bearing T7 promoters. Following infection, cells are transfected with the 
polynucleotide or polynucleotides of interest, driven by a T7 promoter. The polymerase 
expressed in the cytoplasm fi'om the vaccinia virus recombinant transcribes the 
transfected DNA into RNA which is then translated into polypeptide by the host 

15 translational machinery. The method provides for high level, transient, cytoplasmic 

production of large quantities of RNA and its translation products. See, e.g., Eb-oy-Stein 
and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743-6747; Fuerst et al. Proc. Natl. 
Acad. Sci. USA (1986) 83:8122-8126. 

20 Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, can also be used 
to deliver the coding sequences of interest. Recombinant avipox viruses, expressing 
immunogens firom mammalian pathogens, are known to confer protective immunity when 
administered to non-avian species. The use of an Avipox vector is particularly desirable 
in human and other manmialian species since members of the Avipox genus can only 

25 productively replicate in susceptible avian species and therefore are not infective in 
mammalian cells. Methods for producing recombinant Avipoxviruses are known in the 
art and employ genetic recombination, as described above with respect to the production 
of vaccinia viruses. See, e.g., WO 91/12882; WO 89/03429; and WO 92/03545. 



30 



Any of a number of alphavirus vectors can also be used for delivery of polynucleotide 
compositions of the present invention, such as those vectors described in U.S. Patent Nos. 

17 
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5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors based on Venezuelan 
Equine Encephalitis (VEE) can also be used, illustrative examples of which can be found 
in U.S. Patent Nos. 5,505,947 and 5,643,576. 

5 Moreover, molecular conjugate vectors, such as the adenovirus chimeric vectors 

. described m Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et al. Proc. 
Natl. Acad. Sci. USA (1992) 89:6099-6103. can also be used for gene delivery under the 
invention. 

10 Additional illustrative information on these and other known viral-based delivery systems 
can be found, for example, in Fisher-Hoch et al., Proc. Natl. Acad. Sci, USA 56:317-321, 
1989; Flexner et zUAnn. K Y. Acad. ScL 569:86-103, 1989; Flexner et al.. Vaccine 
5:17-21, 1990; U.S. Patent Nos. 4,603,1 12, 4,769,330, and 5,017,487; WO 89/01973; 
U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; WO 91/02805; Berkner, 

15 Biotechniques 6:616-627, 1988; Rosenfeld et al.. Science 252:431-434, 1991; KoUs et al., 
Proc. Natl Acad, ScL USA 97:215-219, 1994; Kass-Eisler et al., Proc, Natl. Acad Sci. 
USA 90:1 1498-1 1502, 1993; Guzman et al., Circulation 55:2838-2848, 1993; and 
Guzman et al., Cir. Res. 75:1202-1207, 1993. 

20 The recombinant live microorganisms described above can be virulent, or attenuated in 
various ways in order to obtain live vaccines. Such live vaccines also form part of the 
invention. 

25 In certain embodiments, a polynucleotide may be integrated into the genome of a target 
cell. This integration may be in the specific location and orientation via homologous 
recombination (gene replacement) or it may be integrated in a random, non-specific 
location (gene augmentation). In yet further embodiments, the polynucleotide may be 
stably maintained in the cell as a separate, episomal segment of DNA. Such 

30 polynucleotide segments or "episomes" encode sequences sufficient to permit 

maintenance and replication independent of or in synchronization with the host cell cycle. 
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The manner in which the expression construct is delivered to a cell and where in the cell 
the polynucleotide remains is dependent on the type of expression construct employed. 

In another embodiment of the invention, a polynucleotide is administered/deUvered as 
5 "naked" DNA, for example as described in Ulmer et al., Science 259: 1 745-1 749, 1 993 
and reviewed by Cohen, Science 259:1691-1692, 1993. The uptake of naked DNA may 
be increased by coating the DNA onto biodegradable beads, which are efficiently 
transported into the cells. 

10 In still another embodiment, a composition of the present invention can be delivered via a 
particle bombardment approach, many of which have been described. In one illustrative 
example, gas-driven particle acceleration can be achieved with devices such as those 
manufactured by Powderject Pharmaceuticals PLC (Oxford, UK) and Powdetject 
Vaccines Inc. (Madison, WI), some examples of which are described in U.S. Patent Nos. 

15 5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 799. This approach 
offers a needle-free delivery approach wherein a dry powder formulation of microscopic 
particles, such as polynucleotide or polypeptide particles, are accelerated to high speed 
within a helium gas jet generated by a hand held device, propelling the particles into a 
target tissue of interest. 

20 

In a related embodiment, other devices and methods that may be useful for gas-driven 
needle-less injection of compositions of the present invention include those provided by 
Bioject, Inc. (Portland, OR), some examples of which are described in U.S. Patent Nes. 
4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 and 5,993,412. 

25 

Polypeptides of the present invention can be recovered and purified from recombinant cell 
cultures by well-known methods including ammonium sidfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose chromatography, 
hydrophobic interaction chromatography, afiBnity chromatography, hydroxylapatite 
30 chromatography and lectin chromatography. Most preferably, ion metal affinity 
chromatography (MAC) is employed for purification. Well known techniques for 
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refolding proteins may be employed to regenerate active conformation when the polypeptide 
is denatured during intracellular synthesis, isolation and or purification. 

Another important aspect of the invention relates to a method for inducing , re-inforcing 
5 or modulating an immunological response in a mammal which comprises inoculating the 
mammal with a fi-agment or the entire polypeptide or polynucleotide of the invention, 
adequate to produce antibody and/or T cell immune response for prophylaxis or for 
therapeutic treatment of cancer and autoimmune disease and related conditions. Yet 
another aspect of the invention relates to a method of inducing, re-inforcing or 
10 modulating immunological response in a mammal which comprises, delivering a 
polypeptide of the present invention via a vector or cell directing expression of the 
polynucleotide and coding for the polypeptide in vivo in order to induce such an 
immunological response to produce immune responses for prophylaxis or treatment of 
said mammal fi"om diseases. 

15 

A further aspect of the invention relates to an immunological/vaccine formulation 
(composition) which, when introduced into a mammalian host, induces, re-inforces or 
modulates an immunological response in that mammal to a polypeptide of the present 
invention wherein the composition comprises a polypeptide or polynucleotide of the 

20 invention or an immunological fragment thereof as herein before defined.The vaccine 
formulation may further comprise a suitable carrier. Since a polypeptide may be broken 
down in the stomach, it is preferably administered parenterally (for instance, 
subcutaneous, intramuscular, intravenous, or intradermal injection). Formulations 
suitable for parenteral administration include aqueous and non-aqueous sterile injection 

25 solutions which may contain anti-oxidants, buffers, bacteriostats and solutes which render 
the formulation isotonic with the blood of the recipient; and aqueous and non-aqueous 
sterile suspensions which may include suspending agents or thickening agents. The 
formulations may be presented in unit-dose or multi-dose containers, for example, sealed 
ampoules and vials and may be stored in a freeze-dried condition requiring only the 

30 addition of the sterile liquid carrier immediately prior to use. 

A further aspect of the invention relates to the in vitro induction of immune responses to a 
fragment or the entire polypeptide or polynucleotide of the present invention or a 

20 
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molecule comprising the polypeptide or polynucleotide of the present invention, using 
cells from the immune system of a mammal, and reinfusing these activated immune cells 
of the manunal for the treatment of disease. Activation of the cells from the immune 
system is achieved by in vitro incubation with the entire polypeptide or polynucleotide of 

5 the present invention or a molecule comprising the polypeptide or polynucleotide of the 
present invention in the presence or absence of various immunomodulator molecules. 
A further aspect of the invention relates to the immunization of a mammal by 
administration of antigen presenting cells modified by in vitro loading with part or the 
entire polypeptide of the present invention or a molecule comprising the polypeptide of 

10 the present invention and administered in vivo in an immunogenic way. Alternatively, 
antigen presenting cells can be transfected in vitro with a vector containing a fragment or 
the entire polynucleotide of the present invention or a molecule comprising the 
polynucleotide of the present invention, such as to express the corresponding polypeptide, 
and administered in vivo in an immunogenic way. 
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According to another embodiment, the pharmaceutical compositions described herein will 
comprise one or more immunostimulants in addition to the immunogenic polynucleotide, 
polypeptide, antibody, T-cell and/or antigen presenting cell (APC) compositions of this 
invention. An inununostimulant refers to essentially any substance that enhances or 

5 potentiates an immune response (antibody and/or cell-mediated) to an exogenous antigen. 
One preferred type of immunostimulant comprises an adjuvant. Many adjuvants contain 
a substance designed to protect the antigen from rapid catabolism, such as aluminum 
hydroxide or mineral oil, and a stimulator of immune responses, such as lipid A, 
Bortadella pertussis or Mycobacterium tuberculosis derived proteins. Certain adjuvants 

10 are commercially available as, for example, Freund's Incomplete Adjuvant and Complete 
Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 65 (Merck and Company, 
Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, PA); aluminum sahs such 
as aluminum hydroxide gel (alum) or aluminum phosphate; salts of calcium, iron or zinc; 
an insoluble suspension of acylated tyrosine; acylated sugars; cationically or anionically 

1 5 derivatized polysaccharides; polyphosphazenes; biodegradable microspheres; 

monophosphoryl lipid A and quil A. Cytokines, such as GM-CSF, interleukin-2. -7, -12, 
and other like growth factors, may also be used as adjuvants. 

Within certain embodiments of the invention, the adjuvant composition is preferably one 
20 that induces an immune response predominantly of the Thl type. High levels of Thl - 
type cytokines (e.g., IFN-y, TNFa, IL-2 and IL-12) tend to favor the induction of cell 
mediated immime responses to an administered antigen. In contrast, high levels of Th2- 
type cytokines {e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the mduction of humoral 
immune responses. Following application of a vaccine as provided herein, a patient will 
25 support an immune response that includes Thl - and Th2-type responses. Within a 

prefened embodiment, in which a response is predominantly Thl-type, the level of Thl - 
type cytokines will increase to a greater extent than the level of Th2-type cytokines. The 
levels of these cytokines may be readily assessed using standard assays. For a review of 
the famiUes of cytokines, see Mosmann and Coffinan, Ann. Rev. Immunol. 7:145-173, 
30 1989. 
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Certain preferred adjuvants for eliciting a predominantly Thl-type response include, for 
example, a combination of monophosphoryl lipid A, preferably 3-de-O-acylated 
monophosphoryl lipid A, together with an aluminum salt. MPL® adjuvants are available 
from Corixa Corporation (Seattle, WA; see, for example, US Patent Nos. 4,436,727; 

5 4,877,6 1 1 ; 4,866,034 and 4,9 12,094). CpG-containing oligonucleotides (in which the 
CpG dinucleotide is unmethylated) also induce a predominantly Thl response. Such 
oUgonucleotides are well known and are described, for example, in WO 96/02555, WO 
99/33488 and U.S. Patent Nos. 6,008,200 and 5,856,462. Iramunostimulatory DNA 
sequences are also described, for example, by Sato et al.. Science 273:352, 1996. 

10 Another preferred adjuvant comprises a saponin, such as Quil A, or derivatives thereof, 
including QS21 and QS7 (Aquila Biopharmaceuticals Inc., Framingham, MA); Escin; 
Digitonin; or Gypsophila or Chenopodium quinoa saponins . Other preferred 
formulations include more than one saponin in the adjuvant combinations of the present 
invention, for example combinations of at least two of the following group comprising 

15 QS21, QS7, Quil A, P-escin, or digitonin. 



Altematively the saponin formulations may be combined with vaccine vehicles composed 
of chitosan or other polycationic polymers, polylactide and polylactide-co-glycolide 
particles, poly-N-acetyl glucosamine-based polymer matrix, particles composed of 

20 polysaccharides or chemically modified polysaccharides, liposomes and lipid-based 
particles, particles composed of glycerol monoesters, etc. The saponins may also be 
formulated in the presence of cholesterol to form particulate structures such as liposomes 
or ISCOMs. Furthermore, the saponins may be formulated together with a 
polyoxyethylene ether or ester, in either a non-particulate solution or suspension, or m a 

25 particulate structure such as a paucilamelar liposome or ISCOM. The saponins may also 
be formulated with excipients such as Carbopol^ to increase viscosity, or may be 
formulated in a dry powder form with a powder excipient such as lactose. 

In one preferred embodiment, the adjuvant system includes the combination of a 
monophosphoryl Upid A and a saponin derivative, such as the combination of QS21 and 
30 3D-MPL® adjuvant, as described in WO 94/00153, or a less reactogenic composition 
where the QS21 is quenched with cholesterol, as described in WO 96/33739. Other 
preferred formulations comprise an oil-in-water emulsion and tocopherol. Another 
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particularly preferred adjuvant formulation employing QS21 , 3D-MPL adjuvant and 
tocopherol in an oil-in-water emulsion is described in WO 95/17210. 

Another enhanced adjuvant system involves the combination of a CpG-containing 
5 oligonucleotide and a saponin derivative particularly the combination of CpG and QS21 
as disclosed in WO 00/09159. Preferably the formulation additionally comprises an oil in 
water emulsion and tocopherol. 

Additional illustrative adjuvants for use in the pharmaceutical compositions of the 
10 invention include Montanide ISA 720 (Seppic, France), SAF (Chiron, California, United 
States), ISCOMS (CSL), MF-59 (Chiron), the SB AS series of adjuvants (e.g.. SBAS-2 or 
SBAS-4, available from SmithKline Beecham, Rixensart, Belgium), Detox (Enhanzyn^ 
(Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, MT) and other aminoalkyl 
glucosaminide 4-phosphates (AGPs), such as those described in pending U.S. Patent 
15 Application Serial Nos. 08/853,826 and 09/074,720, the disclosures of which are 

incorporated herein by reference in their entireties, and polyoxyethylene ether adjuvants 
such as those described in WO 99/52549A1 . 

Other preferred adjuvants include adjuvant molecules of the general formula (I): 
HO(CH2CH20)„-A.R 

20 Wherein, n is 1-50, A is a bond or -C(0)-, R is C1.50 alkyl or Phenyl C1.50 alkyl. 

One embodiment of the present invention consists of a vaccine formulation comprising a 
polyoxyethylene ether of general formula (I), wherein n is between 1 and 50, preferably 
4-24, most preferably 9; the R component is C1.50, preferably C4-C20 alkyl and most 
preferably C12 alkyl, and .4 is a bond. The concentration of the polyoxyethylene ethers 
25 should be in the range 0.1-20%, preferably from 0.1-10%, and most preferably in the 
range 0.1-1%. Preferred polyoxyethylene ethers are selected from the following group: 
polyoxyethylene-9-lauryl ether, polyoxyethylene-9-steoryl ether, polyoxyethylene-8- 
steoryl ether, polyoxyethylene-4-lauryl ether, polyoxyethylene-35-lauryl ether, and 
polyoxyethylene-23-lauryl ether. Polyoxyethylene ethers such as polyoxyethylene lauryl 
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ether are described in the Merck index (12^ edition: enfry 7717). These adjuvant 
molecules are described in WO 99/52549. 

The polyoxyethylene ether according to the general formula (I) above may, if desired, be 
5 combined with another adjuvant. For example, a preferred adjuvant combination is 
preferably with CpG as described in the pending UK patent application GB 9820956.2. 



Preferably a carrier is also present in the vaccine composition according to the invention. 
10 The carrier may be an oil in water emulsion, or an aluminium salt, such as aluminium 
phosphate or aluminium hydroxide. 

A preferred oil-in-water emulsion comprises a metaboHsible oil, such as squalene, alpha 
tocopherol and Tween 80. In a particularly preferred aspect the antigens in the vaccine 
15 composition according to the invention are combined with QS21 and 3D-MPL in such an 
emulsion. Additionally the oil in water emulsion may contain span 85 and/or lecithin 
and/or tricaprylin. 

Typically for human administration QS21 and 3D-MPL will be present in a vaccine in the 
20 range of 1 |ig - 200jig, such as 1 0- 1 00|ig, preferably 1 O^ig - 50^g per dose. Typically the 
oil in water will comprise from 2 to 10% squalene, from 2 to 10% alpha tocopherol and 
from 0.3 to 3% tween 80. Preferably the ratio of squalene: alpha tocopherol is equal to 
or less than 1 as this provides a more stable emulsion. Span 85 may also be present at a 
level of 1%. In some cases it may be advantageous that the vaccines of the present 
25 invention will further contain a stabiliser. 

Non-toxic oil in water emulsions preferably contain a non-toxic oil, e.g. squalane or 
squalene, an emulsifier, e.g. Tween 80, in an aqueous carrier. The aqueous carrier may 
be, for example, phosphate buffered saline. 

30 
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A particularly potent adjuvant formulation involving QS21, 3D-MPL and tocopherol in 
an oil in water emulsion is described in WO 95/17210. 

The present invention also provides a polyvalent vaccine composition comprising a vaccine 
5 fomiulation of the invention in combination with other antigens, in particular antigens useful 
for treating cancers, autoimmxme diseases and related conditions. Such a polyvalent vaccine 
composition may include a TH-1 inducing adjuvant as hereinbefore described. 

According to another embodiment of this invention, an immunogenic composition 
10 described herein is delivered to a host via antigen presenting cells (APCs), such as 

dendritic cells, macrophages, B cells, monocytes and other cells that may be engineered 
to be efficient APCs. Such cells may, but need not, be genetically modified to increase 
the capacity for presenting the antigen, to improve activation and/or maintenance of the T 
cell response, to have anti-tumor effects per se and/or to be immunologically compatible 
15 with the receiver (/.e.» matched HLA haplotype). APCs may generally be isolated fi*om 
any of a variety of biological fluids and organs, including tumor and peritumoral tissues, 
and may be autologous, allogeneic, syngeneic or xenogeneic cells. 

Certain preferred embodiments of the present invention use dendritic cells or progenitors 
20 thereof as antigen-presenting cells. Dendritic cells are highly potent APCs (Banchereau 
and Steinman, Nature 392:245-25 1 , 1998) and have been shown to be effective as a 
physiological adjuvant for eliciting prophylactic or therapeutic antitumor immunity {see 
Timmerman and Levy,i4nw. Rev, Med. 50:507-529, 1999). In general, dendritic cells 
may be identified based on their typical shape (stellate in situ, with marked cytoplasmic 
25 processes (dendrites) visible in vitro), their ability to take up, process and present 
antigens with high efficiency and their ability to activate naive T cell responses. 
Dendritic cells may, of course, be engineered to express specific cell-surface receptors or 
ligands that are not commonly found on dendritic cells in vivo or ex vzvo, and such 
modified dendritic cells are contemplated by the present invention. As an alternative to 
30 dendritic cells, secreted vesicles antigen-loaded dendritic cells (called exosomes) may be 
used within a vaccine (see Zitvogel et al., Nature Med. ^:594-600, 1998). 
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Dendritic cells and progenitors may be obtained from peripheral blood, bone marrow, 
tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph nodes, spleen, skin, 
umbilical cord blood or any other suitable tissue or fluid. For example, dendritic cells 
may be differentiated ex vivo by adding a combination of cytokines such as GM-CSF, IL- 

5 4, EL- 13 and/or TNFa to cultures of monocytes harvested from peripheral blood. 

Alternatively, CD34 positive cells harvested from peripheral blood, umbilical cord blood 
or bone marrow may be differentiated into dendritic cells by adding to the culture 
medium combinations of GM-CSF, IL-3, TNFa, CD40 ligand, LPS, flt3 ligand and/or 
other compound(s) that induce differentiation, maturation and proliferation of dendritic 

10 cells. 

Dendritic cells are conveniently categorized as "inmiature" and "mature" cells, which 
allows a simple way to discriminate between two well characterized phenotypes. 
However, this nomenclature should not be construed to exclude all possible intermediate 
stages of differentiation. Immature dendritic cells are characterized as APC with a high 

15 capacity for antigen uptake and processing, which correlates with the high expression of 
Fey receptor and mannose receptor. The mature phenotype is typically characterized by a 
lower expression of these markers, but a high expression of cell surface molecules 
responsible for T cell activation such as class I and class 11 MHC, adhesion molecules 
(e.g., CD54 and GDI 1) and costimulatory molecules (e.g., CD40, CD80, CD86 and 4- 

20 IBB). 

APCs may generally be transfected with a polynucleotide of the invention (or portion or 
other variant thereof) such that the encoded polypeptide, or an immunogenic portion 
thereof, is expressed on the cell surface. Such transfection may take place ex vivo, arid a 

25 phannaceutical composition comprising such transfected cells may then be used for 
therapeutic purposes, as described herein. Alternatively, a gene delivery vehicle that 
targets a dendritic or other antigen presenting cell may be administered to a patient, 
resulting in transfection that occurs in vivo. In vivo and ex vivo transfection of dendritic 
cells, for example, may generally be performed using any methods known in the art, such 

30 as those described in WO 97/24447, or the gene gun approach described by Mahvi et al.. 
Immunology and cell Biology 75:456-460, 1997. Antigen loading of dendritic cells may 
be achieved by incubating dendritic cells or progenitor cells with the tumor polypeptide, 
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DNA (naked or within a plasmid vector) or RNA; or with antigen-expressing 
recombinant bacterium or viruses {e.g., vaccinia, fowlpox, adenovirus or lentivirus 
vectors). Prior to loading, the polypeptide may be covalently conjugated to an 
immunological partner that provides T cell help {e.g., a carrier molecule). Alternatively, 
5 a dendritic cell may be pulsed with a non-conjugated immxmological partner, separately 
or in the presence of the polypeptide. 

While any suitable carrier known to those of ordinary skill in the art may be employed in 
the pharmaceutical compositions of this invention, the type of carrier will typically vary 
10 depending on the mode of administration. Compositions of the present invention may be 
formulated for any appropriate manner of administration, including for example, topical, 
oral, nasal, mucosal, intravenous, intracranial, intraperitoneal, subcutaneous and 
intramuscular administration. 

Carriers for use within such pharmaceutical compositions are biocompatible, and may 
15 also be biodegradable. In certain embodiments, the formulation preferably provides a 
relatively constant level of active component release. In other embodiments, however, a 
more rapid rate of release immediately upon administration may be desired. The 
formulation of such compositions is well within the level of ordinary skill in the art using 
known techniques. Illustrative carriers useful m this regard include microparticles of 
20 poly(lactide-co-glycolide), polyacrylate, latex, starch, cellulose, dextran and the Uke. 
Other illustrative delayed-release carriers include supramolecular biovectors, which 
comprise a non-liquid hydrophilic core {e.g., a cross-linked polysaccharide or 
oligosaccharide) and, optionally, an external layer comprising an amphiphilic compound, 
such as a phospholipid {see e.g,, U.S. Patent No. 5,151,254 and PCT applications WO 
25 94/20078, WO/94/2370 1 and WO 96/06638). The amount of active compound contained 
within a sustained release formulation depends upon the site of implantation, the rate and 
expected duration of release and the nature of the condition to be treated or prevented. 

In another illustrative embodiment, biodegradable microspheres {e.g., polylactate 
30 polyglycolate) are employed as carriers for the compositions of this invention. Suitable 
biodegradable microspheres are disclosed, for example, in U.S. Patent Nos. 4,897,268; 
5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 5,814,344, 5,407,609 and 
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5,942,252. Modified hepatitis B core protein earner systems, such as described in 
WO/99 40934, and references cited therein, will also be useful for many applications. 
Another illustrative carrier/delivery system employs a carrier comprising particulate- 
protein complexes, such as those described in U.S. Patent No. 5,928,647, which are 
5 capable of inducing a class I-restricted cytotoxic T lymphocyte responses in a host. 

The pharmaceutical compositions of the invention will often further comprise one or 
more buffers {e.g., neutral buffered saline or phosphate buffered saline), carbohydrates 
(e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, polypeptides or amino 
acids such as glycine, antioxidants, bacteriostats, chelating agents such as EDTA or 
10 glutathione, adjuvants (e.g., aliuninum hydroxide), solutes that render the formulation 
isotonic, hypotonic or weakly hypertonic with the blood of a recipient, suspending agents, 
thickening agents and/or preservatives. Alternatively, compositions of the present 
invention may be formulated as a lyophilizate. 

15 The pharmaceutical compositions described herein may be presented in unit-dose or 
multi-dose containers, such as sealed ampoules or vials. Such containers are typically 
sealed in such a way to preserve the sterility and stabiUty of the formulation until use. In 
general, formulations may be stored as suspensions, solutions or emulsions in oily or 
aqueous vehicles. Alternatively, a pharmaceutical composition may be stored in a freeze- 

20 dried condition requiring only the addition of a sterile liquid carrier immediately prior to 
use. 

The development of suitable dosing and treatment regimens for using the particular 
compositions described herein in a variety of treatment regimens, including e.g., oral, 
25 parenteral, intravenous, intranasal, and intramuscular administration and formulation, is 
well known in the art, some of which are briefly discussed below for general purposes of 
illustration. 

In certain applications, the pharmaceutical compositions disclosed herein may be 
30 delivered via oral administration to an animal. As such, these compositions may be 
formulated with an inert diluent or with an assimilable edible carrier, or they may be 
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enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into tablets, or 
they may be incorporated directly with the food of the diet. 

The active compounds may even be incorporated with excipients and used in the form of 

5 ingestible tablets, buccal tables, troches, capsules, elixirs, suspensions, syrups, wafers, 
and the like (see, for example, Mathiowitz et a/., Nature 1997 Mar 27;386(6623):410-4; 
Hwang et al, Crit Rev Ther Drug Carrier Syst 1 998; 1 5(3):243-84; U. S. Patent 
5,641,515; U. S. Patent 5,580,579 and U. S. Patent 5,792,451). Tablets, troches, pills, 
capsules and the like may also contain any of a variety of additional components, for 

10 example, a binder, such as gum tragacanth, acacia, cornstarch, or gelatin; excipients, such 
as dicalcium phosphate; a disintegrating agent, such as com starch, potato starch, alginic 
acid and the like; a lubricant, such as magnesium stearate; and a sweetening agent, such 
as sucrose, lactose or saccharin may be added or a flavoring agent, such as peppermint, 
oil of wintergreen, or cherry flavoring. When the dosage unit form is a capsule, it may 

15 contain, in addition to materials of the above type, a liquid carrier. Various other 
materials may be present as coatings or to otherwise modify the physical form of the 
dosage unit. For instance, tablets, pills, or capsules may be coated with shellac, sugar, or 
both. Of course, any material used in preparing any dosage unit form should be 
pharmaceutically pure and substantially non-toxic in the amounts employed. In addition, 

20 the active compounds may be incorporated into sustained-release preparation and 
formulations. 

Typically, these formulations will contain at least about 0.1% of the active compound or 
more, although the percentage of the active ingredient(s) may, of course, be varied and 

25 may conveniently be between about 1 or 2% and about 60% or 70% or more of the 

weight or volume of the total formulation. Naturally, the amount of active compound(s) 
in each therapeutically useful composition may be prepared is such a way that a suitable 
dosage will be obtained in any given unit dose of the compound. Factors such as 
solubility, bioavailability, biological half-life, route of administration, product shelf life, 

30 as well as other pharmacological considerations will be contemplated by one skilled in 
the art of preparing such pharmaceutical formulations, and as such, a variety of dosages 
and treatment regimens may be desirable. 
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For oral administration the compositions of the present invention may alternatively be 
incorporated with one or more excipients in the form of a mouthwash, dentifrice, buccal 
tablet, oral spray, or sublingual orally-administered formulation. Alternatively, the active 

5 ingredient may be incorporated into an oral solution such as one containing sodium 
borate, glycerin and potassium bicarbonate, or dispersed in a dentifrice, or added in a 
therapeutically-effective amount to a composition that may include water, binders, 
abrasives, flavoring agents, foaming agents, and hxmiectants. Alternatively the 
compositions may be fashioned into a tablet or solution form that may be placed under 

10 the tongue or otherwise dissolved in the mouth. 



In certain circumstances it will be desirable to deliver the pharmaceutical compositions 
disclosed herein parenterally, intravenously, intramuscularly, or even intraperitoneally. 
Such approaches are well known to the skilled artisan, some of which are further 

15 described, for example, in U. S. Patent 5,543,158; U. S. Patent 5,641,515 and U. S. Patent 
5,399,363. In certain embodiments, solutions of the active compounds as free base or 
phamiacologically acceptable salts may be prepared in water suitably mixed with a 
surfactant, such as hydroxypropylcellulose. Dispersions may also be prepared in 
glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. Under ordinary 

20 conditions of storage and use, these preparations generally will contain a preservative to 
prevent the growth of microorganisms. 



Illustrative pharmaceutical forms suitable for injectable use include sterile aqueous 
solutions or dispersions and sterile powders for the extemporaneous preparation of sterile 

25 injectable solutions or dispersions (for example, see U. S. Patent 5,466,468). In all cases 
the form must be sterile and must be fluid to the extent that easy syringability exists. It 
must be stable under the conditions of manufacture and storage and must be preserved 
against the contaminating action of microorganisms, such as bacteria and fimgi. The 
carrier can be a solvent or dispersion medium containing, for example, water, ethanol, 

30 polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, and the like), 
suitable mixtures thereof, and/or vegetable oils. Proper fluidity may be maintained, for 
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example, by the use of a coating, such as lecithin, by the maintenance of the required 
particle size in the case of dispersion and/or by the use of surfactants. The prevention of 
the action of microorganisms can be facilitated by various antibacterial and antifungal 
agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the 
5 like. In many cases, it will be preferable to include isotonic agents, for example, sugars 
or sodium chloride. Prolonged absorption of the injectable compositions can be brought 
about by the use in the compositions of agents delaying absorption, for example, 
aliuninum monostearate and gelatin. 



10 In one embodiment, for parenteral administration in an aqueous solution, the solution 
should be suitably buffered if necessary and the liquid diluent first rendered isotonic with 
sufficient saline or glucose. These particular aqueous solutions are especially suitable for 
intravenous, intramuscular, subcutaneous and intraperitoneal administration. In this 
connection, a sterile aqueous medium that can be employed will be known to those of 

15 skill in the art in light of the present disclosure. For example, one dosage may be 
dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml of 
hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570-1580). 
Some variation in dosage will necessarily occur depending on the condition of the subject 

20 being treated. Moreover, for human administration, preparations will of course 

preferably meet sterility, pyrogenicity, and the general safety and purity standards as 
required by FDA Office of Biologies standards. 



In another embodiment of the invention, the compositions disclosed herein may be 
25 formulated in a neutral or salt form. Illustrative pharmaceutically-acceptable salts include 
the acid addition salts (formed with the jfree amino groups of the protein) and which are 
fonned with inorganic acids such as, for example, hydrochloric or phosphoric acids, or 
such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the 
firee carboxyl groups can also be derived fi-om inorganic bases such as, for example, 
30 sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as 
isopropylamine, trimethylamine, histidine, procaine and the like. Upon formulation, 
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solutions will be administered in a manner compatible with the dosage formulation and in 
such amount as is therapeutically effective. 

The carriers can further comprise any and all solvents, dispersion media, vehicles, 
5 coatings, diluents, antibacterial and antifungal agents, isotonic and absorption delaying 
agents, buffers, carrier solutions, suspensions, colloids, and the like. The use of such 
media and agents for pharmaceutical active substances is well known in the art. Except 
insofar as any conventional media or agent is incompatible with the active ingredient, its 
use in the therapeutic compositions is contemplated. Supplementary active ingredients 
10 can also be incorporated into the compositions. The phrase 

"pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar untoward reaction when administered to a human. 

In certain embodiments, the pharmaceutical compositions may be delivered by intranasal 
15 sprays, inhalation, and/or other aerosol delivery vehicles. Methods for delivering genes, 
nucleic acids, and peptide compositions directly to the lungs via nasal aerosol sprays has 
been described, e.g., in U. S. Patent 5,756,353 and U. S. Patent 5,804,212. Likewise, the , 
delivery of drugs using intranasal microparticle resins (Takenaga et aL, J Controlled 
Release 1998 Mar 2;52(l-2):81-7) and lysophosphatidyl-glycerol compounds (U. S. 
20 Patent 5,725,871) are also well-known in tiie pharmaceutical arts. Likewise, illustrative 
transmucosal drug delivery in the form of a polytetrafluoroetheylene support matrix is 
described in U. S. Patent 5,780,045. 

In certain embodiments, liposomes, nanocapsules, microparticles, lipid particles, vesicles, 
25 and the like, are used for the introduction of the compositions of the present invention 
into suitable host cells/organisms. In particular, the compositions of the present invention 
may be formulated for delivery either encapsulated in a lipid particle, a Hposome, a 
vesicle, a nanosphere, or a nanoparticle or the like. Alternatively, compositions of the 
present invention can be bound, either covalentiy or non-covalently, to the surface of 
30 such carrier vehicles. 
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The formation and use of liposome and liposome-like preparations as potential drug 
carriers is generally known to those of skill in the art (see for example, Lasic, Trends 
Biotechnol 1998 Jul;16(7):307-21; Takakura, Nippon Rinsho 1998 Mar;56(3):691-5; 
Chandran et aL, Indian J Exp Biol. 1997 Aug;35(8):801.9; Margalit, Crit Rev Ther Drug 
5 Carrier Syst. 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. Patent 5,552,157; U.S. 
Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 5,795,587, each specifically 
incorporated herein by reference in its entirety). 

Liposomes have been used successfully with a number of cell types that are normally 
10 difficult to transfect by other procedures, including T cell suspensions, primary 
hepatocyte cultures and PC 12 cells (Renneisen et aL, J Biol Chem. 1990 Sep 
25,265(27): 16337-42; Muller et aL, DNA Cell Biol 1990 Apr;9(3):221-9). In addition, 
liposomes are fi-ee of the DNA length constraints that are typical of viral-based delivery 
systems. Liposomes have been used effectively to introduce genes, various drugs, 
15 radiotherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, into a variety of cultured cell lines and animals. Furthermore, he use of 
liposomes does not appear to be associated with autoimmune responses or unacceptable 
toxicity after systemic delivery. 

20 In certain embodiments, liposomes are formed fi-om phospholipids that are dispersed in 
an aqueous medium and spontaneously form multilamellar concentric bilayer vesicles 
(also termed multilamellar vesicles (MLVs). 

Altematively, in other embodiments, the invention provides for pharmaceutically- 
25 acceptable nanocapsule formulations of the compositions of the present invention. 

Nanocapsules can generally entrap compounds in a stable and reproducible way (see, for 
example, Quintanar-Guerrero et aL, Drug Dev Ind Pharm. 1998 Dec;24(12): 11 13-28). 
To avoid side effects due to intracellular polymeric overioading, such ultrafine particles 
(sized around 0.1 ^m) may be designed using polymers able to be degraded in vivo. Such 
30 particles can be made as described, for example, by Couvreur et aL, Crit Rev Ther Drug 
Carrier Syst. 1988;5(l):l-20; zur Muhlen et aL, Eur J Pharm Biopharm. 1998 
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Mar;45(2): 149-55; Zambaux et al J Controlled Release. 1998 Jan 2;50(l-3):31-40; and 
U.S. Patent 5,145,684. 

This invention also relates to the use of polynucleotides, in the form of primers derived from 
5 the polynucleotides of the present invention, and of polypeptides, in the form of antibodies 
or reagents specific for the polypeptide of the present invention, as diagnostic reagents. 

The identification of genetic or biochemical markers in blood or tissues that will enable the 
detection of very early changes along the carcinogenesis pathway will help in determining 

10 the best treatment for the patient. Surrogate tumour markers, such as polynucleotide 

expression, can be used to diagnose different forms and states of cancer. The identification 
of expression levels of the polynucleotides of the invention will be usefiil in both the 
staging of the cancerous disorder and grading the nature of the cancerous tissue. The staging 
process monitors the advancement of the cancer and is determined on the presence or 

15 absence of malignant tissue m the areas biopsied. The polynucleotides of the invention can 
help to perfect the staging process by identifying markers for the aggresivity of a cancer, for 
example the presence in different areas of the body. The grading of the cancer describes 
how closely a tumour resembles normal tissue of its same type and is assessed by its cell 
morphology and other markers of differentiation. The polynucleotides of the invention can 

20 be useful in determining the tumour grade as they can help in the determination of the 
differentiation status of the cells of a tumour. 

The diagnostic assays offer a process for diagnosing or determining a susceptibility to 
cancers, autoimmime disease and related conditions through diagnosis by methods 

25 comprising determining fi-om a sample derived from a subject an abnormally decreased or 
increased level of polypeptide or mRNA. This method of diagnosis is known as 
differential expression. The expression of a particular gene is compared between a 
diseased tissue and a normal tissue. A difference between the polynucleotide-related 
gene, mRNA, or protein in the two tissues is compared, for example in molecular weight, 

30 amino acid or nucleotide sequence, or relative abundance, indicates a change in the gene, 
or a gene which regulates it, in die tissue of the human that was suspected of being 
diseased. 
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Decreased or increased expression can be measured at the RNA level. PolyA RNA is 
first isolated from the two tissues and the detection of mRNA encoded by a gene 
corresponding to a differentially expressed polynucleotide of the invention can be 

5 detected by, for example, in situ hybridization in tissue sections, reverse trascriptase- 
PCR, using Northern blots containing poly A+ mRNA, or any other direct or inderect 
RNA detection method. An increased or decreased expression of a given RNA in a 
diseased tissue compared to a normal tissue suggests that the transcript and/or the expressed 
protein has a role in the disease. Thus detection of a higher or lower level of mRNA 

10 corresponding to SEQ ID NO 1 or 3 relative to nomial level is indicative of the presence 
of cancer in the patient. 

mRNA expression levels in a sample can be determined by generation of a library of 
expressed sequence tags (ESTs) from the sample. The relative representation of ESTs in 
15 the library can be used to assess the relative representation of the gene transcript in the 
starting sample. The EST analysis of the test can then be compared to the EST analysis 
of a reference sample to determine the relative expression levels of the polynucleotide of 
interest. 

20 Other mRNA analyses can be carried out using serial analysis of gene expression (SAGE) 
methodology (Velculescu et. Al. Science (1995) 270:484) , differential display 
methodology (For example, US 5,776,683) or hybridization analysis which relies on the 
specificity of nucleotide interactions. 

25 Altematively, the comparison could be made at the protein level. The protein sizes in the 
two tissues may be compared using antibodies to detect polypeptides in Westem blots of 
protein extracts from the two tissues. Expression levels and subcellular localization may 
also be detected immunologically using antibodies to the corresponding protein. Further 
assay techniques that can be used to determine levels of a protein, such as a polypeptide of 

30 the present invention, in a sample derived from a host are well-known to those of skill in the 
art. A raised or decreased level of polypeptide expression in the diseased tissue compared 
with the same protein expression level in the normal tissue indicates that the expressed 
protein may be involved in the disease. 
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In the assays of the present invention, the diagnosis can be determined by detection of gene 
product expression levels encoded by at least one sequence set forth in SEQ ID NOS: 1 or 3. 
A comparison of the mRNA or protein levels in a diseased versus normal tissue may also be 
5 used to follow the progression or remission of a disease. 

A large number of polynucleotide sequences in a sample can be assayed using 
polynucleotide arrays. These can be used to examine differential expression of genes and to 
determine gene function. For example, arrays of the polynucleotide sequences SEQ ID NO: 

10 1 or 3 can be used to determine if any of the polynucleotides are differentially expressed 
between a normal and cancer cell. In one embodiment of the invention, an array of 
oligonucleotides probes comprising the SEQ ID NO: 1 or 3 nucleotide sequence or 
fragments thereof can be constructed to conduct efficient screening of e.g., genetic 
mutations. Array technology methods are well known and have general appUcability and 

15 can be used to address a variety of questions in molecular genetics including gene 

expression, genetic linkage, and genetic variability (see for example: M.Chee et al., Science, 
Vol 274, pp 610-613 (1996)). 

"Diagnosis" as used herein includes determination of a subject's susceptibility to a 
20 disease, determination as to whether a subject presently has the disease, and also the 
prognosis of a subject affected by the disease. 

The present invention, further relates to a diagnostic kit for performing a diagnostic assay 
which comprises: 

25 (a) a polynucleotide of the present invention, preferably the nucleotide sequence of SEQ 
ID NO: 1 or 3, or a fragment thereof ; 

(b) a nucleotide sequence complementary to that of (a); 

(c) a polypeptide of the present invention, preferably the polypeptide of SEQ ED NO: 2 or 
4, or a fragment thereof; or 

30 (d) an antibody to a polypeptide of the present invention, preferably to the polypeptide of 
SEQIDN0:2 OR 4. 
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The nucleotide sequences of the present invention are also valuable for chromosomal 
localisation. The sequence is specifically targeted to, and can hybridize with, a particular 
location on an individual human chromosome. The mapping of relevant sequences to 
chromosomes according to the present invention is an important first step in correlating 

5 those sequences with gene associated disease. Once a sequence has been mapped to a 

precise chromosomal location, the physical position of the sequence on the chromosome can 
be correlated with genetic map data. Such data are found in, for example, V. McKusick, 
Mendelian Inheritance in Man (available on-line through Johns Hopkins University Welch 
Medical Library). The relationship between genes and diseases that have been mapped to 

10 the same chromosomal region are then identified through linkage analysis (coinheritance of 
physically adjacent genes).The differences in the cDNA or genomic sequence between 
affected and unaffected individuals can also be determined. 

The polypeptides of the invention or their Segments or analogs thereof, or cells expressing 
15 them, can also be used as immunogens to produce antibodies immunospecific for 

polypeptides of the present invention. The term "immunospecific" means that the antibodies 
have substantially greater affinity for the polypeptides of the invention than their affinity for 
other related polypeptides in the prior art. 

20 In a further aspect the invention provides an antibody immunospecific for a polypeptide 
according to the invention or an immunological fiagment thereof as hereinbefore defined. 
Preferably the antibody is a monoclonal antibody 

Antibodies generated against polypeptides of the present invention may be obtained by 
25 administering the polypeptides or epitope-bearing fi-agments, analogs or cells to an animal, 
preferably a non-human animal, using routine protocols. For preparation of monoclonal 
antibodies, any technique which provides antibodies produced by continuous cell line 
cultures can be used. Examples include the hybridoma technique (Kohler, G. and Milstein, 
C, Nature (1975) 256:495-497), the trioma technique, the human B-cell hybridoma 
30 technique (Kozbor et al. Immunology Today (1983) 4:72) and the EBV-hybridoma 
technique (Cole et al. Monoclonal Antibodies and Cancer Therapy, 77-96, Alan R. Liss, 
Inc., 1985). 
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Techniques for the production of single chain antibodies, such as those described in U.S. 
Patent No. 4,946,778, can also be adapted to produce single chain antibodies to polypeptides 
of this invention. Also, transgenic mice, or other organisms, including other mammals, may 
be used to express humanized antibodies. 

5 

The above-described antibodies may be employed to isolate or to identify clones expressing 
the polypeptide or to purify the polypeptides by affinity chromatography. 
The antibody of the invention may also be employed to prevent or treat cancer, particularly 
ovarian and colon cancer, autoimmune disease and related conditions. 

10 

Another aspect of tiie invention relates to a method for inducing or modulating an 
immunological response in a mammal which comprises inoculating the manmial with a 
polypeptide of the present invention, adequate to produce antibody and/or T cell inmiune 
response to protect or ameliorate the symptoms or progression of the disease. Yet 
1 5 another aspect of the invention relates to a metiiod of inducing or modulating 

immunological response in a mammal which comprises, delivering a polypeptide of the 
present invention via a vector directing expression of the polynucleotide and coding for 
the polypeptide in vivo in order to induce such an immunological response to produce 
antibody to protect said animal from diseases. 

20 

It will be appreciated that the present invention therefore provides a method of treating 
abnormal conditions such as, for instance, cancer and autoimmune diseases, in particular, 
ovarian and colon cancer, related to either a presence ot an excess of, or an under- 
expression of, CASB6411 polypeptide activity. 

25 

The present invention further provides for a method of screening compounds to identify 
those which stimulate or which inhibit the function of the CASB641 1 polypeptide. In 
general, agonists or antagonists may be employed for therapeutic and prophylactic purposes 
for such diseases as hereinbefore mentioned. Compounds may be identified from a variety 
30 of sources, for example, cells, cell-free preparations, chemical libraries, and natural product 
mixtures. Such agonists, antagonists or inhibitors so-identified may be natural or modified 
substrates, ligands, receptors, enzymes, etc., as the case may be, of the polypeptide; or may 
be structural or functional mimetics tfiereof (see Coligan et al.y Current Protocols in 
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Immunology l(2):Chapter 5 (1991)). Screening methods will be known to those skilled in 
the art. Further screening methods may be found in for example D. Bennett et ai, J Mol 
Recognition, 8:52-58 (1995); and K. Johanson et ai, J Biol Chem, 270(1 6):9459-9471 
(1995) and references therein. 

5 

Thus the invention provides a method for screening to identify compounds which stimulate 
or which inhibit the function of the polypeptide of the invention which comprises a method 
selected from the group consisting of: 

(a) measuring the binding of a candidate compound to the polypeptide (or to the cells or 
10 membranes bearing the polypeptide) or a fusion protein thereof by means of a label 

directly or indirectly associated with the candidate compound; 

(b) measuring the binding of a candidate compound to the polypeptide (or to the cells or 
membranes bearing the polypeptide) or a fusion protein thereof in the presense of a 
labeled competitior; 

15 (c) testing whether the candidate compound results in a signal generated by activation or 
inhibition of the polypeptide, using detection systems appropriate to the cells or cell 
membranes bearing the polypeptide; 

(d) mixing a candidate compound with a solution containing a polypeptide of claim 1, to 
form a mixture, measuring activity of the polypeptide in the mixture, and comparing the 

20 activity of the mixture to a standard; or 

(e) detecting the effect of a candidate compound on the production of mRNA encoding 
said polypeptide and said polypeptide in cells, using for instance, an ELIS A assay. 

The polypeptide of the invention may be used to identify membrane bound or soluble 
25 receptors, if any, through standard receptor binding techniques known in the art. Well 
known screening methods may also be used to identify agonists and antagonists of the 
polypeptide of the invention which compete with the binding of the polypeptide of the 
invention to its receptors, if any. 

30 Thus, in another aspect, the present invention relates to a screening kit for identifying 
agonists, antagonists, ligands, receptors, substrates, enzymes, etc. for polypeptides of the 
present invention; or compounds which decrease or enhance the production of such 
polypeptides, which comprises: 
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(a) a polypeptide of the present invention; 

(b) a recombinant cell expressing a polypeptide of the present invention; 

(c) a cell membrane expressing a polypeptide of the present invention; or 

(d) antibody to a polypeptide of the present invention; 

5 which polypeptide is preferably that of SEQ ID N0:2 OR 4. 

It will be readily appreciated by the skilled artisan that a polypeptide of the present 
invention may also be used in a method for the structure-based design of an agonist, 
antagonist or inhibitor of the polypeptide, by: 
10 (a) determining in the first instance the three-dimensional structure of the polypeptide; 

(b) deducing the three-dimensional structure for the likely reactive or binding site(s) of 
an agonist, antagonist or inhibitor; 

(c) synthesing candidate compounds that are predicted to bind to or react with the 
deduced binding or reactive site; and 

15 (d) testing whether the candidate compounds are indeed agonists, antagonists or 
inhibitors. 

Gene therapy may also be employed to effect the endogenous production of CASB641 1 
polypeptide by the relevant cells in the subject. For an overview of gene therapy, see 
20 Chapter 20, Gene Therapy and other Molecular Genetic-based Therapeutic Approaches, 
(and references cited therein) in Human Molecular Genetics, T Strachan and A P Read, 
BIOS Scientific Publishers Ltd (1996). 

Vaccine preparation is generally described in Pharmaceutical Biotechnology, Vol.61 
25 Vaccine Design - the subunit and adjuvant approach, edited by Powell and Newman, 
Plenum Press, 1995. New Trends and Developments in Vaccines, edited by Voller et al., 
University Park Press, Baltimore, Maryland, U.S.A. 1978. Encapsulation within 
liposomes is described, for example, by FuUerton, U.S. Patent 4,235,877. Conjugation of 
proteins to macromolecules is disclosed, for example, by Likhite, U.S. Patent 4,372,945 
30 and by Armor et al., U.S. Patent 4,474,757. 

The amount of protein in each vaccine dose is selected as an amount which induces an 
immunoprotective response without significant, adverse side effects in typical vaccinees. 
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Such amount will vary dq)ending upon which specific immunogen is employed. 
Generally, it is expected that each dose will comprise 1-lOOO^g of protein, preferably 
2-100|ag, most preferably 4-40ng. An optimal amount for a particular vaccine can be 
ascertained by standard studies involving observation of antibody titres and other 
5 responses in subjects. Following an initial vaccination, subjects may receive a boost in 
about 4 weeks. 

"Isolated" means altered "by the hand of man" fi-om the natural state. If an "isolated" 
composition or substance occurs in nature, it has been changed or removed from its 
10 original environment, or bodi. For example, a polynucleotide or a polypeptide naturally 
present in a Uving animal is not "isolated," but the same polynucleotide or polypeptide 
separated from the coexisting materials of its natural state is "isolated", as the term is 
employed herein. 

15 "Polynucleotide" generally refers to any polyribonucleotide or polydeoxribonucleotide, 
which may be unmodified RNA or DNA or modified RNA or DNA including single and 
double stranded regions. 

'^Variant" refers to a polynucleotide or polypeptide that differs from a reference 
20 polynucleotide or polypeptide, but retains essential properties. A typical variant of a 
polynucleotide differs in nucleotide sequence from another, reference polynucleotide. 
Changes in the nucleotide sequence of the variant may or may not alter the amino acid 
sequence of a polypeptide encoded by the reference polynucleotide. Nucleotide changes 
may result in amino acid substitutions, additions, deletions, fiisions and truncations in the 
25 polypeptide encoded by the reference sequence, as discussed below. A typical variant of 
a polypeptide differs in amino acid sequence from another, reference polypeptide. 
Generally, differences are limited so that the sequences of the reference polypeptide and 
the variant are closely similar overall and, in many regions, identical. A variant and 
reference polypeptide may differ in amino acid sequence by one or more substitutions, 
30 additions, deletions in any combination. A substituted or inserted amino acid residue 
may or may not be one encoded by the genetic code. A variant of a polynucleotide or 
polypeptide may be a naturally occurring such as an allelic variant, or it may be a variant 
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that is not known to occur naturally. Non-naturally occurring variants of polynucleotides 
and polypeptides may be made by mutagenesis techniques or by direct synthesis. 

"Identity," as known in the art, is a relationship between two or more polypeptide sequences 

5 or two or more polynucleotide sequences, as detemiined by comparing the sequences. In the 
art, "identity" also means the degree of sequence relatedness between polypeptide or 
polynucleotide sequences, as the case may be, as determined by the match between 
strings of such sequences. "Identity" and "similarity" can be readily calculated by known 
methods, including but not limited to those described in (Computational Molecular 

10 Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed.. Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., 
Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, 
G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, 

15 J., eds., M Stockton Press, New York, 1991 ; and Carillo, H., and Lipman, D., SIAM J, 
Applied Math., 48: 1073 (1988). Preferred methods to determine identity are designed to 
give the largest match between the sequences tested. Methods to determine identity and 
similarity are codified in publicly available computer programs. Preferred computer 
program methods to determine identity and similarity between two sequences include, but 

20 are not limited to, the GCG program package (Devereux, J., et al.. Nucleic Acids 

Research 12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Atschul, S.F. et al., J. 
Molec. BioL 215: 403-410 (1990). The BLAST X program is publicly available from 
NCBI and other sources (BLAST Manual. Altschul, S., et al, NCBI NLM NIH Bethesda, 
MD 20894; Altschul, S., et ai, J. Mol. BioL 215: 403-410 (1990). The well known Smith 

25 Waterman algorithm may also be used to determine identity. 

The preferred algorithm used is FASTA. The preferred parameters for polypeptide or 
polynuleotide sequence comparison using this algorithm include the following: 
Gap Penalty: 12 
30 Gap extension penalty: 4 
Word size: 2, max 6 
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Preferred parameters for polypeptide sequence comparison with other methods include 
the following: 

1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
Comparison matrix: BLOSSUM62 Ifrom Hentikoff and Hentikoff, Proc. Natl. Acad. Sci, 
5 USA. 89:10915-10919 (1992) 
Gap Penalty: 12 
Gap Length Penalty: 4 

A program useful with these parameters is publicly available as the "gap" program from 
10 Genetics Computer Group, Madison WI. The aforementioned parameters are the default 
parameters for polypeptide comparisons (along with no penalty for end gaps). 

Prefenred parameters for polynucleotide comparison include the following: 
1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 
15 Comparison matrix: matches = +10, mismatch = 0 
Gap Penalty: 50 
Gap Length Penalty: 3 

A program usefiil with these parameters is publicly available as the "gap" program from 
20 Genetics Computer Group. Madison WI. The aforementioned parameters are the default 
parameters for polynucleotide comparisons. 

By way of example, a polynucleotide sequence of the present invention may be identical 
to the reference sequence of SEQ E) N0:1, that is be 100% identical, or it may include 

25 up to a certain integer nimiber of nucleotide alterations as compared to the reference 
sequence. Such alterations are selected from the group consisting of at least one 
nucleotide deletion, substitution, including transition and transversion, or insertion, and 
wherein said alterations may occur at the 5' or 3' terminal positions of the reference 
nucleotide sequence or anywhere between those terminal positions, interspersed either 

30 individually among the nucleotides in the reference sequence or in one or more 

contiguous groups within the reference sequence. The number of nucleotide alterations is 
determined by multiplying the total number of nucleotides in SEQ ID NO: 1 by the 
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numerical percent of the respective percent identity(divided by 100) and subtracting that 
product from said total number of nucleotides in SEQ ID N0:1, or: 

nn^Xn-(Xn»y), 

wherein is the number of nucleotide alterations, is the total number of nucleotides 
5 in SEQ ID NO: 1, and y is, for instance, 0.70 for 70%, 0.80 for 80%, 0.85 for 85%, 0.90 
for 90%, 0.95 for 95%,etc., and wherein any non-integer product of and y is rounded 
down to the nearest integer prior to subtracting it from x^. Alterations of a 
polynucleotide sequence encoding the polypeptide of SEQ ID N0:2 OR 4 may create 
nonsense, missense or frameshift mutations in this coding sequence and thereby alter the 
10 polypeptide encoded by the polynucleotide following such alterations. 

Similarly, a polypeptide sequence of the present invention may be identical to the 
reference sequence of SEQ ED N0:2 OR 4, that is be 100% identical, or it may include up 
to a certain integer number of amino acid alterations as compared to the reference 

15 sequence such that the % identity is less than 100%. Such alterations are selected from 
the group consisting of at least one amino acid deletion, substitution, including 
conservative and non-conservative substitution, or insertion, and wherein said alterations 
may occur at the amino- or carboxy-terminal positions of the reference polypeptide 
sequence or anywhere between those terminal positions, interspersed either individually 

20 among the amino acids in the reference sequence or in one or more contiguous groups 
within the reference sequence. The number of amino acid alterations for a given % 
identity is determined by multiplying the total number of amino acids in SEQ ID N0:2 
OR 4 by the numerical percent of the respective percent identity(divided by 100) and then 
subtracting that product from said total number of amino acids m SEQ ID N0:2 OR 4, or: 

25 na^a-(Xa»y). 

wherein is the number of amino acid alterations, x^ is the total number of amino acids 
in SEQ ID N0:2 OR 4, and y is, for instance 0.70 for 70%, 0.80 for 80%, 0.85 for 85% 
etc., and wherein any non-integer product of x^ and y is rounded down to the nearest 

integer prior to subtracting it from x^. 
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*'HomoIog" is a generic terni used in the art to indicate a polynucleotide or polypeptide 
sequence possessing a high degree of sequence relatedness to a subject sequence. Such 
relatedness may be quntified by determining the degree of identity and/or similarity between 
the sequences being compared as hereinbefore described. Falling within this generic term 
5 are the terms "ortholog", meaning a polynucleotide or polypeptide that is the functional 
equivalent of a polynucleotide or polypeptide in another species and "paralog" meaning a 
functionally similar sequence when considered within the same species. 

10 Examples 

Example 1 

Subtractive cDNA cloning of colon tumour-associated antigen (TAA) candidates. 

Subtractive cDNA libraries are produced using standard technologies. Briefly, total 

15 RNA is extracted from frozen (-70*^0) tumour and matched normal colon samples using the 
TriPure reagent and protocol (Boehringer). Target RNA is prepared by pooling total RNA 
from three tumour samples (30 ng each). Driver RNA is prepared by pooling total RNA 
from three matched normal colon samples (10 ng each) and total RNA from seven normal 
tissues other than colon(brain, heart, kidney, Uver, bladder, skin, spleen; 10 ^g each). Total 

20 RNA from non-colon normal tissues is purchased from InVitrogen. 

Messenger RNA is purified from total RNA using oUgo-dT magnetic bead 
technology (Dynal) and quantified by spectrofluorimetry (BioRad). 

Target and driver mRNA are reverse transcribed into cDNA using one of two 
strategies: 1) Target sequences for PGR oligonucleotides are introduced onto the ends of the 

25 newly synthesised cDNA during reverse transcription using the template switching 
capability of reverse transcriptase (ClonTech SMART PGR cDNA synthesis kit). 2) 
Alternatively, the target and driver mRNA are reverse transcribed into cDNA using an 
oligo-dT primer and converted to double-strand cDNA; the cDNA is cleaved with Rsal and 
linkers for PGR amplification are ligated onto the extremities of the cDNA Segments. 

30 In both cases, target and driver cDNA are amplified by long range PGR (ClonTech 

SMART PGR Synthesis Kit and Advantage PGR Polymerase Mix) and used as starting 
material for subtractive cloning. For amplification, cycling conditions and optimisation of 
the number of PGR cycles are as described in the Advantage PGR protocol 
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Two subtractive cloning strategies are used: ClonTech ?CR SELECT (see 
ClonTech kit protocol and N. Gurskaya et al 1996. Analytical Biochemistry: 240, 90) and 
cRDA(M. Hubank and D. Schatz. 1994. Nucleic Acids Research: 22. 5640) . When the PCR 
SELECT protocol is used, the primary PCR SELECT subtraction products are submitted to 
5 a supplementary round of cRDA subtraction. When the cRDA protocol is used, two 
consecutive cycles of cRDA subtraction are performed. In each case the products of both 
cycles of subtraction are cloned into pCR-TOPO (InVitrogen) and transformed into E. coli 
to produce a subtracted cDNA plasmid library. 

10 An alternative strategy is also followed: subtraction of nomial colon sequences and 

sequences from non-colon normal tissues are subtracted in separate hybridizations. In this 
case, target and driver RNA are assembled for the first subtraction as above with the 
exception that non-colon RNA is left out of the driver pool and amounts of normal colon are 
increased to 10 ^g. Preparation of target and driver cDNA and subtractive hybridization are 

15 perfomied as described above. A second subtraction is then perforaied on the products of die 
first subtraction, but the driver is now composed of a pool of normal colon and normal non- 
colon mRNA from the seven noraial tissues. 

Example 2 

20 Differential Screening of cDNA arrays. 

Identification of tumour-associated genes in the subtracted cDNA library is accomplished by 
differential screening. 

Total bacterial DNA is extracted from 100 ^1 over-night cultures. Bacteria are lysed with 
25 guanidium isothiocyantate and the bacterial DNA is affinity purified using magnetic glass 
(Boehringer). Plasmid inserts are recovered from the bacterial DNA by Advantage PCR 
amplification (Clontech). The PCR products are dotted onto two nylon membranes to 
produce high density cDNA arrays using the Biomek 96 HDRT tool (Beekman). The 
spotted cDNA is covalentiy linked to the membrane by UV irradiation. The first membrane 
30 is hybridised with a mixed cDNA probe prepared from the tumour of a single patient. The 
second membrane is hybridised with an equivalent amount of mixed cDNA probe prepared 
from nomial colon of the same patient. The probe cDNA is prepared by PCR amplification 
as described above and is labelled using the AlkPhos Direct System (Amersham). 
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Hybridisation conditions and stringency washes are as described in the AlkPhos Direct kit. 
Hybridized probe is detected by chemiluminescence. Hybridisation intensities for each 
cDNA fragment on both blots are measured by fihn densitometry or direct measurement 
(BioRad Fluor-S Max). The ratio of the tumour to normal hybridisation intensities (T/N) is 
5 calculated for each gene to evaluate the degree of over-expression in the tumour. Genes 
which are significantly over-expressed in colon tumours are foUowed-up, Significance is 
arbitrarily defined as one standard deviation of the T/N fi-equency distribution. Differential 
screening experiments are repeated using RNA from multiple patient donors (>18) to 
estimate the frequency of over-expressing tumours in the patient population. 
10 In addition, the DNA airays are hybridised with mixed cDNA probes from normal tissues 
other than colon (see list above) to determine the level of expression of the candidate gene in 
these tissues. 

Example 3 
15 Real-time RT-PCR analysis 

Real-time RT-PCR (U. Gibson. 1996. Genome Research: 6,996) is used to compare 
mRNA transcript abundance of the candidate antigen in matched tumour and normal 
colon tissues from multiple patients. In addition, mRNA levels of the candidate gene in a 
panel of normal tissues are also evaluated by this approach. 

20 

Total RNA from normal and tumour colon is extracted from snap frozen biopsies using 
TriPure reagent (Boehringer). Total RNA from normal tissues is purchased from 
InVitrogen or is extracted from snap frozen biopsies using TriPure reagent. Poly-A+ 
mRNA is purified from total RNA after DNAase treatment using oligo-dT magnetic 
25 beads (Dynal). Quantification of the mRNA is performed by spectrofluorimetry 

(VersaFluor, BioRad) using Sybrll dye (Molecular Probes). Primers for real-time PCR 
amplification are designed with the Perkin-Elmer Primer Express software using default 
options for TaqMan amplification conditions. 

30 Real-time reactions are assembled according to standard PCR protocols using 2 ng of 
purified mRNA for each reaction. SybrI dye (Molecular Probes) is added at a final 
dilution of 1/75000 for real-time detection. Amplification (40 cycles) and real-time 
detection is performed in a Perkin-Elmer Biosystems PE7700 system using conventional 
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instrument settings. Ct values are calculated using the PE7700 Sequence Detector 
software. Several Ct values are obtained for each samples : for the patient samples, the 
tumour Ct (CtT) and the matched normal colon Ct (CtN) values on the candidate TAA, 
and for the panel of normal tissue samples, a CtXY for each normal tissue XY. An 
5 another Ct (CtA) is also calculated on Actin gene, as an internal reference, for all of the 
samples. 

As the eflRciency of PCR amplification under the prevailing experimental conditions is 
close to the theoretical amplification efficiency, 2^^^^^^*^^^ value is an estimate of the 
10 relative TAA transcript level of the sample, standardised with respect to Actin transcript 
level. A value of 1 thus suggests the candidate antigen and Actin have the same 
expression level. 

Real-time PCR reactions were performed on tumour colon and adjacent normal colon 
15 fi*om biopsies of 18 patients. Results are shown in figure 1 . 36 normal tissue samples, 
representing 28 different tissues (see table 2), were also tested by the same procedure. 
Results are shown in figure 2. 

TAA transcript levels are calculated as described above. The proportion of patients over- 
20 expressing the candidate antigen, as well as the average transcript over-expression versus 
normal tissues is also calculated firom this data set. 
Overall results are shown in Table 1 : 

Table 1 : CASB6411 Real-time PCR expression results 

25 



% of patients with a CASB6411 transcript level higher in tumotxr colon than 
adjacent nonnal colon (positive patients) 


83% 


% of positive patients with a CASB6411 transcript level at least 3 fold higher 
CASB641 1 transcript than adjacent normal colon 


60% 


% of positive patients with a CASB641 1 transcript level at least 10 fold higher in 
tumour colon than adjacent normal colon 


40% 


Average transcript over-expression fold in positive patients 


40 


% of patients with a CASB6411 transcript level higher in tumour colon than 
nonnal tissue median 


100% 


% of patients with a mRNA level at least 3 fold higher in tumour colon than 
normal tissue median 


100% 



49 



wo 01/23417 



PCT/EPOO/09500 



% of patients with a mRNA level at least 10 fold higher in tumour colon than 
norma! tissue median 


40% 


Normal tissues where CASB6411 transcript expression is equivalent than tumour 
transcript level in tumours 


Lung, 
prostate 



Table 1 clearly suggest CASB641 1 transcript is over-expressed in colorectal tumours 
compared to adjacent normal colon and to most of the above mentioned normal tissues. 
5 More than 80% of the patients strongly over-express CASB641 1 transcript in tumour, as 
compared to adjacent normal colon. Average over-expression fold is at least of 40, with 
40 % of patients having overexpressing at least 10 fold. More over, all of the patients 
over-express CASB641 1 transcript in colorectal tumors, as compared to other normal 
tissues, 40 % of them overexpressing it at least 10 fold. 

10 

Table 2 : listing of aormal tissues used for CASB7439 transcript expression analysis. 



Tissue 


Abbreviation 


Categorie 


Adrenal gland 


Ad_GI 


non dispensable 


Aorta 


Ao 


non dispensable 


Bladder 


81 


non dispensable 


Bone marrow 


Bo_Ma 


non dispensable 


Brain 


Bra 


non dispensable 


Cervix 


Ce 


non dispensable 


Colon 


Co 


non dispensable 


Fallopian tube 


Fa_Tu 


non dispensable 


Heart 


He 


non dispensable 


lleon 


II 


non dispensable 


Kidney 


Ki 


non dispensable 


Liver 


Li 


non dispensable 


Lung 


Lu 


non dispensable 


Lymph node 


Ly_No 


non dispensable 


Oesophagus 


Oe 


non dispensable 


Parathyroid gland 


Pa_Thy 


non dispensable 


Rectum 


Re 


non dispensable 


Skin 


Sk 


non dispensable 


Skeletal muscle 


Sk Mu 


non dispensable 


Small intestine 


Sm In 


non dispensable 


Spleen 


Sp 


non dispensable 


Stomach 


St 


non dispensable 


Thyroid gland 


Thy 


non dispensable 


Trachea 


Tra 


non dispensable 


Ovary 


Ov 


dispensable 


Placenta 


PI 


dispensable 


Prostate 


Pr 


dispensable 


Testis 


Te 


dispensable 
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Example 4 

DNA microarrays 

DNA micro-arrays are used to examine mRNA expression profiles of large collections of 
genes in multiple samples. This information is used to complement the data obtained by 
5 real-time PGR and provides an independent measure of gene expression levels in tumors 
and normal tissues. 

Examples of current technologies for production of DNA micro-arrays include 1) The 
Affymetrix "GeneChip" arrays in which oligonucleotides are synthetized on the surface 

10 of the chip by solid phase chemical synthesis using a photolithographic process 2) DNA 
spotting technology in which small volumes of a DNA solution are robotically deposited 
and then immobilized onto the surface of a solid phase (e.g. glass). In both instances, the 
chips are hybridized with cDNA or cRNA which has been extracted from the tissue of 
interest (e.g. normal tissue, tumour etc.. .) and labeled with radioactivity or with a 

15 fluorescent reporter molecule. The labeled material is hybridized to the chip and the 
amoimt of probe boimd to each sequence on the chip is determined using a speciaUzed 
scanner. The experiment can be set-up with a single fluorescent reporter (or radioactivity) 
or, alternatively, can be performed using two fluorescent reporters. In this latter case, 
each of the two samples is labeled with one of the reporter molecules. The two labeled 

20 samples are then hybridized competitively to the sequences on the DNA chip. The ratio 
of the two fluorescent signals is determined for each sequence on the chip. This ratio is 
used to calculate the relative abundance of the transcript in the two samples. Detailed 
protocols are available from a nimiber of sources mcluding "DNA Microarrays: A 
practical approach. Schena M. Oxford University Press 1999" and the World Wide Web 

25 (http://cmgm.stanford.eduypbrown/protocols/index.html), http://arTayit.com/DNA- 
Microarray-Protocols/) and specialized distributors (e.g. Affymetrix). 



Example 5 
30 Northern-Southern blot analysis 

Limited amounts of mixed tumour and matched normal colon cDNA are amplified by 
Advantage PGR (see above). Messenger RNA from multiple normal tissues is also 
amplified using the same procedure. The amplified cDNA (1 ng) is electrophoresed on a 
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1.2% agarose gel and transferred onto a nylon membrane. The membrane is hybridised 
(AlkPhos Direct System) with a probe prepared using a fragment of the candidate TAA 
cDNA. Northern-Southern analysis provides information on transcript size, presence of 
splice variants and transcript abundance in tumour and normal tissues. 

5 

Example 6 

Northern Blot Analysis 

Northern blots are produced according to standard protocols using 1 ^g of poly A+ 
mRNA. Radioactive probes are prepared using the Ready-to-Go system (Pharmacia). 

10 

Example 7 

In silico detection of the full length cDNA sequence 

EST sequence databases are screened with experimentally obtained cDNA sequence 
fragments, using the Blast algorithm (Altschul, S.F., Gish, W., Miller, W., Myers, E.W. 

15 & Lipman, D.J. (1990) "Basic local ahgnment search tool." J. Mol. Biol. 215:403-41). 
The aim is to search for overlapping or longer identical EST sequences. Matched EST 
sequences are then assembled together, using the SeqMan software of the Lasergene 
package (DNASTAR). The consensus sequence of the resulting assembly is an EST- 
derived longer cDNA. This EST-derived cDNA is further analysed using the GeneMark 

20 software to find a potential open reading frame (ORF). The translated sequence of the 
ORE is compared with protein databases, using the Blast algorithm, to find homologues. 
If any, the homologous protein sequences are further used to complete the cDNA 
prediction by searching for genomic contig homologies using the Wise2 algorithm, 
leading to a genome-derived, virtual cDNA sequence. This virtual cDNA in finally 

25 assembled with EST-derived cDNA, and the new consensus cDNA undergoes a final 
check against ESTs to confirm the Wise2 prediction, and correct potential sequencing 
errors and frameshifts. The virtual cDNA is considered as a virtual full length cDNA 
once a full ORF (from start to stop codons), with clear protein homologies and coding 
potential. 

30 The SEQ ID NO: 1 has been obtained using this in silico cloning method of the full length 
cDNA sequence of CASB6411, which has a putative open reading frame of 460 amino 
acids (SEQ ID N0:2), and a potential isoform generated by alternative splicing (SEQ ID 
N0:3), encoding a truncated protein (SEQ ID N0:4). 
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Virtual full-length cDNA sequence is experimentally checked as described below. 
Example 8 

Experimental Identification of the full length cDNA sequence 

5 Colon tumour cDNA libraries are constructed using the Lambda Zap n system 

(Stratagene) from 5 jig of polyA+ mRNA. The supplied protocol is followed except that 
Superscript!! (Life Technologies) is used for the reverse transcription step. OUgo dT- 
primed and random-primed libraries are constructed. About 1.5 xl06 independent phages 
are plated for each screening of the library. Phage plaques are transferred onto nylon 

10 filters and hybridised using a cDNA probe labelled with AlkPhos Direct. Positive phages 
are detected by chemiluminescence. Positive phage are excised from the agar plat, eluted 
in 500^1 SM buffer and confirmed by gene-specific PGR. Eluted phages are converted to 
single strand Ml 3 bacteriophage by in vivo excision. The bacteriophage is then converted 
to double strand plasmid DNA by infection of E. coli. Lifected bacteria are plated and 

15 submitted to a second round of screening with the cDNA probe. Plasmid DNA is purified 
from positive bacterial clones and sequenced on both strands. 

When the full length gene cannot be obtained directly from the cDNA library, missing 
sequence is isolated using RACE technology (Marathon Kit, ClonTech.). This approach 
20 relies on reverse transcribing mRNA into double strand cDNA, Ugating hnkers onto the 
ends of the cDNA and amplifying the desired extremity of the cDNA using a gene- 
specific primer and one of the linker oUgonucleotides. Marathon PCR products are cloned 
into a plasmid (pCRI!-TOPO, InVitrogen) and sequenced. 

25 Example 9. 
EST profiles 

A complementary approach to experimental antigen tissue expression characterization is 
to explore the human EST database. ESTs ('Expressed Sequence Tags ) are small 
fragments of cDNA made from a collection of mRNA extracted from a particular tissue 
30 or cell line. Such database currently provides a massive amount of human ESTs (10^) 
from several hundreds of cDNA tissue libraries, including tumoral tissues from various 
types and states of disease. By means of informatics tools (Blast), a comparison search of 
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the CASB641 1 sequence is performed in order to have further insight into tissue 
expression. 



EST distribution of CASB6411 : 



EST GenBank Accession number 


EST cDNA tissue library 


C00562 


Human adult (K.Okubo) 


C05837 


Human pancreatic islet 


AA1 72076 


Stratagene ovarian cancer (#937219) 


AA1 72244 


Stratagene ovarian cancer (#937219) 


AA612697 


NCI_CGAP_Co10 


AA371314 


Prostate gland 1 


AI264367 


NCI_CGAP„Co8 


AI270207 


NCI CGAP Co14 


AI278830 


NCI_CGAP_Co8 


AI281230 


NCI CGAP Co8 


AI283827 


NCI CGAP Co8 


AI285194 


NCI_CGAP_Co8 


AI285227 


NCI_CGAP_Co8 


AI346622 


NCI CGAP Co8 


AI362363 


NCI CGAP Gas4 


AI473464 


NCI CGAP Gas4 


AI697014 


NCI CGAP Pan1 


A1799626 


NCI CGAP Gas4 


Ai830044 


NCI CGAP Lu19 


Ai921465 


NCI CGAP Gas4 


AI925050 


NCLCGAP_Gas4 


AW029127 


NCI CGAP Gas4 


AW365013 


DT0057 


AW452356 


NCI CGAP Sub5 


AW469177 


NCI CGAP Gas4 


AW469181 


NCI CGAP Gas4 


AW582253 


ST0212 


AW810203 


ST0125 


AW810268 


ST0125 


AW810418 


ST0125 


AW814058 


ST0198 


AW869793 


SN0075 



SEQ ID N0:1 perfectly aligns with 32 ESTs : 13 are from 2 stomach cDNA libraries, 9 
are from 3 tumor colon libraries, 2 are from one tumor ovary library, one is from one 
tumor pancreas library, one is from one lung tumor library, one is from one mixed tumor 
10 library, one is from one normal prostate Ubrary, one is from one normal stomach library, 
one is from normal pancreas library, 2 are from libraries of unknown type. 
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This clearly suggests, as expected, that CASB641 1 is over-expressed in tumor tissues, 
with an emphasis in colorectal and stomach tumor tissues, as compared to normal 
tissues. 

5 Example 10 : 

10*1 Expression and purification of tumour-specific antigens 

Expression in microbial hosts, or alternatively in vitro transcription/translation, is used to 
produce the antigen of the invention for vaccine purposes and to produce protein 
10 fragments or whole protein for rapid purification and generation of antibodies needed for 
characterization of the naturally expressed protein by immunohistochemistry or for 
follow-up of purification. 

Recombinant proteins may be expressed in two microbial hosts, E. coli and in yeast 
(such as Saccharomyces cerevisiae or Pichia pastoris). This allows the selection of the 
15 expression system with the best features for this particular antigen production. In 
general, the recombinant antigen will be expressed in E. coli and the reagent protein 
expressed in yeast. 

The expression strategy first involves the design of the primary structure of the . 
recombinant antigen. In general an expression fiision partner (EFP) is placed at the N 
20 terminal extremity to improve levels of expression that could also include a region usefiil 
for modulating the immunogenic properties of the antigen, an immune fiision partner 
(IFP). In addition, an affinity fusion partner (AFP) usefiil for facilitating fiulher 
purification is included at the C-terminal end. 

When the recombinant strains are available, the recombinant product is characterized by 
25 the evaluation of the level of expression and the prediction of fiirther solubility of the 
protein by analysis of the behavior in the crude extract. 

After growth on appropriate culture medium and induction of the recombinant protein 
expression, total extracts are analyzed by SDS-PAGE. The recombinant proteins are 
visualized in stained gels and identified by Westem blot analysis using specific 
30 antibodies. 

A comparative evaluation of the different versions of the expressed antigen will allow the 
selection of the most promising candidate that is to be used for fiirther purification and 
immunological evaluation. 
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The purification scheme follows a classical approach based on the presence of an His 
affinity tail in the recombinant protein. In a typical experiment the disrupted cells are 
filtered and the acellular extracts loaded onto an Ion Metal Affinity Chromatography 
5 (IMAC; Ni-H-NTA fi-om Qiagen) that will specifically retain the recombinant protein. 
The retained proteins are eluted by 0-500 mM Imidazole gradient (possibly in presence of 
a detergent) in a phosphate buffer. 

10.2 Antibody production and immunohistochemistry 

10 Small amounts of relatively purified protein can be used to generate immunological 
tools in order to 

a) detect the expression by immunohistochemistry in normal or cancer tissue sections; 

b) detect the expression, and to follow the protein during the purification process 
(ELISA/ Western Blot); or 

15 c) characterise/ quantify the purified protein (ELISA). 

10.2.1 Polyclonal antibodies: 
Immunization 

Rabbits are immunised , intramuscularly (I.M.) , 3 times at 3 weeks intervals with 
20 lOOng of protein, formulated in the adjuvant 3D-MPL/QS21 , Three weeks after each 
immunisation a blood sample is taken and the antibody titer estimated in the serum by 
ELISA using the protein as coating antigen following a standard protocol. 

ELISA 

25 96 well microplates (maxisorb Nunc) are coated with 5|ig of protein overnight at 4°C. 
After Ihour saturation at 37°C with PBS NCS 1%, serial dilution of the rabbit sera is 
added for IH 30 at 37*^0 (starting at 1/10). After 3 washings in PBS Tween. anti rabbit 
biotinylated anti serum (Amersham ) is added (1/5000). Plates are washed and 
peroxydase coupled streptavidin (1/5000) is added for 30 min at 37X. After washing, 

30 50^1 TMB (BioRad) is added for 7 min and the reaction then stopped with H2S04 0.2M. 
The OD can be measured at 450 nm and midpoint dilutions calculated by SoftmaxPro. 

10.2.2 Monoclonal antibodies : 
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Immunization 

5 B ALB/c mice are immunized 3 times at 3 week intervals with 5 fig of purified protein. 
Bleedings are performed 14 days post n and 1 week post 3. The sera are tested by Elisa 
on purified protein used as coated antigen. Based on these results (midpoint dilution > 
5 10000) one mouse is selected for fusion 

Fusion/ HATselection 

Spleen cells are fused with the SP2/0 myeloma according to a standard protocol using 
PEG 40% and DMSO 5%. Cells are then seeded in 96 well plates 2.5 xl04 - 105 
10 cells/well and resistant clones will be selected in HAT medium. The supernatant of these 
hybridomas will be tested for their content in specific antibodies and when positive, 
will be submitted to 2 cycles of limited dilution . After 2 rounds of screening, 3 
hybridomas will be chosen for ascitis production. 

15 10.2.3 Immunohistochemistry 

When antibodies are available, immuno staining is performed on normal or cancer tissue 
sections, in order to detemaine : 

0 the level of expression of the antigen of the invention in cancer relative to nonnal 
tissue or 

20 0 the proportion of cancer of a certain type expressing the antigen 
0 if other cancer types also express the antigen 
0 the proportion of cells expressing the antigen in a cancer tissue 

25 Tissue sample preparation 

After dissection, the tissue sample is moxmted on a cork disk in OCT compound and 
rapidly firozen in isopentane previously super cooled in liquid nitrogen (-160°C). The 
block will then be conserved at -70°C until use. 7-lOfim sections will be realised in a 
cryostat chamber (-20, -30°C). 

30 

Staining 

Tissue sections are dried for 5 min at room Temperature (RT), fixed in acetone for 

lOmin at RT, dried again, and saturated v^th PBS 0.5% BSA 5% serum. After 30 min at 
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RT either a direct or indirect staining is performed using antigen specific antibodies. A 
direct staining leads to a better specificity but a less intense staining whilst an indkect 
staining leads to a more intense but less specific staining. 

5 10 J Analysis of human cellular immune responses to the antigen of the invention 

The tmmimological relevance of the antigen of the invention can be assessed by in vitro 
priming of human T cells. All T cell lymphocyte lines and dendritic cells are derived 
from PBMCs (peripheral blood mononuclear cells) of healthy donors (preferred HLA-A2 
10 subtype). An HLA-A2.1/Kb transgenic mouse model is also used for screening of HLA- 
A2.1 peptides. 

Newly discovered antigen-specific CD8+ T cell lines are raised and maintained by 
weekly in vitro stimulation. The lytic activity and the Q-IFN production of the CDS lines 
15 in response to the antigen or antigen derived-pep tides is tested using standard procedures. 

Two strategies to raise the CD8+ T cell lines are used: a peptide-based approach and a 
whole gene-based approach. Both approaches require the fiill-length cDNA of the newly 
discovered antigen in the correct reading firame to be either cloned in an appropriate 
20 delivery system or to be used to predict the sequence of HLA binding peptides. 

Peptide-based approach 

Briefly, transgenic mice are immunized with adjuvanted HLA-A2 peptides, those unable 
to induce a CDS response (as defined by an efficient lysis of peptide-pulsed autologous 

25 spleen cells) will be further analyzed in the human system. 

Human dendritic cells (cultured according to Romani et al.) will be pulsed with peptides 
and used to stimulate CD8-sorted T cells (by Facs). After several weekly stimulations, the 
CDS lines will be first tested on peptide-pulsed autologous BLCL (EBV-B transformed 
cell lines). To verify the proper in vivo processing of the peptide, the CDS Unes will be 

30 tested on cDNA-transfected tumour cells (HLA-A2 transfected LnCaP, Skov3 or CAMA 
tumour cells). 



Whole gene-based approach 
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CD8+ T cell lines will be primed and stimulated with either gene-gun transfected 
dendritic cells, retrovirally transduced B7.1 -transfected fibroblasts, recombinant pox 
virus (Kim et al.) or adenovirus (Butterfield et al.) infected dendritic cells. Virus infected 
cells are very efficient to present antigenic peptides since the antigen is expressed at high 
5 level but can only be used once to avoid the over-growth of viral T cells lines. 

After alternated stimulations, the CD8+ lines are tested on cDNA-transfected tumour 
cells as indicated above. Peptide specificity and identity is determined to confirm the 
immunological validation. 

10 

CD4+ T-cell response 

Similarly, the CD4+ T-cell immune response can also be assessed. Generation of specific 
CD4+ T-cells is made using dendritic cells loaded with recombinant purified protein or 
15 peptides to stimulate the T-cells. 

Predicted epitopes (nonamers and decamers) binding HLA alleles : 

The HLA Class I binding peptide sequences are predicted either by the Parker's 

algorithm (Parker, K. C, M. A. Bednarek, and J. E, Coligan. 1994. Scheme for ranking 

20 potential HLA-A2 binding peptides based on independent binding of individual peptide 
side-chains. J. Immunol. 152:163 and http://bimas.dcrt.nih.gov/molbio/hla_bind/) or the 
Rammensee method (Rammensee, Friede, Stevanovic, MHC ligands and peptide motifs: 
1st listing, Immunogenetics 41, 178-228, 1995 ; Rammensee, Bachmann, Stevanovic: 
MHC ligands and peptide motifs. Landes Bioscience 1997, and 

25 http://134.2.96.221/scripts/hlaserver.dll/home.htm). Peptides are then screened in the 
HLA-A2.1/Kb transgenic mice model (Vitiello et aL). 

The HLA Class n binding peptide sequences are predicted using the Tepitope algorithm, 
with a score cut-off set to 6 (Stumiolo, Hammer at al., Nature Biotechnology. 1999. 
17;555-561). 

30 The following tables gather the Class I and n predicted epitope sequences : 



HLA-Al : decamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQID: 


1 


379 


ITEGRKIMIR 


112.500 


SEQ ID N0:9 
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HLA-A0201 : nonamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ED ; 


1 


191 


LLMDFVFSL 


27926.980 


SEQ ID NO: 10 


2 


292 


FLLFFPSFT 


2650.811 


SEQ ID NO: 11 


3 


285 


QMMTFFIFL 


2266.849 


SEQ ID NO: 12 


4 


286 


MMTFFIFLL 


1329.564 


SEQ ID NO: 13 


5 


403 


FLIEKLIKL 


926.658 


SEQ ID NO: 14 


6 


147 


VLLIRt^IFL 


739.032 


SEQ ID NO: 15 


7 


240 


LVWIGIFFC 


450.023 


SEQ ID NO: 16 


8 


305 


TLAITIWRL 


368.501 


SEQ ID NO: 17 


9 


22 


LIFCWDFTV 


348.892 


SEQ ID NO: 18 


10 


203 


FLGEFLRRI 


343.941 


SEQ ID NO: 19 


11 


112 


LLLPFVVSC 


273.114 


SEQ ID NO:20 


12 


66 


LLTRFSAYM 


176.513 


SEQ ID N0:21 


13 


21 


KLIFCWDFT 


119.495 


SEQ ID NO:22 




HLA-A0201 : decamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ID: 


1 


190 


LLLMDFVFSL 


6811.458 


SEQ ID NO:23 


2 


191 


LLMDFVFSLV 


3407.985 


SEQ ID NO:24 


3 


285 


QMMTFFIFLL 


2893.757 


SEQ ID NO:25 


4 


93 


YLAEYNLEFL 


2497.344 


SEQ ID NO:26 


5 


21 


KLIFCWDFTV 


1411.906 


SEQ ID NO:27 


6 


293 


LLFFPSFTGV 


831.216 


SEQ ID NO:28 


7 


65 


QLLTRFSAYM 


384.175 


SEQIDNO:29 


8 


239 


TLVWIGIFFC 


364.502 


SEQ ID NO:30 


9 


284 


SQMMTFFIFL 


318.231 


SEQ ID N0:31 


10 


111 


VLLLPFWSC 


273.114 


SEQIDN0:32 


11 


162 


ILCYYWLNTV 


271.948 


SEQ ID N0:33 


12 


366 


TLIVLIITYL 


270.234 


SEQIDNO:34 


13 


146 


YVLLIRNIFL 


174.977 


SEQIDNO:35 


14 


195 


FVFSLVNSFL 


174.977 


SEQIDNO:36 


15 


112 


LLLPFVVSC I 


150.931 


SEQIDNO:37 


16 


355 


LIGSVHFFFI 


147.808 


SEQIDNO:38 


17 


138 


YEMPRHEVYV 


126.598 


SEQ ID NO:39 


18 


66 


LLTRFSAYMV 


118.238 


SEQ ID NO:40 



HLA A205 : nonamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ID : 


1 


191 


LLMDFVFSL 


514.080 


SEQ ID NO: 10 


2 


403 


. FLIEKLIKL 


126.000 


SEQ ID NO: 14 
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5 



HLA A205 : decamers 




Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ID 


1 


190 


LLLMDFVFSL 


171.360 


SEQ ID NO:23 


2 


146 


YVLLIRNIFL 


126.000 


SEQ ID NO:35 


3 


195 


FVFSLVNSFL 


126.000 


SEQIDNO:36 


4 


285 


QMMTFFIFLL 


100.800 


SEQ ID NO:25 




HLA A24 : nonamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ro : 


1 


14 


lYSGGITKL 


220.000 


SEQ ID N0:41 


2 


91 


VYYLAEYNL 


200.000 


SEQ ID NO:42 


3 


165 


YYWLNTVAL 


200.000 


SEQ ID NO:43 


4 


187 


lYRLLLMDF 


120.000 


SEQ ID NO:44 




HLA A24 : decamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score 


SEQ ID : 


1 


145 


VYVLLIRNIF 


252.000 


SEQIDNO:45 


2 


164 


CYYWLNTVAL 


200.000 


SEQIDNO:46 


3 


92 


YYLAEYNLEF 


165,000 


SEQIDNO:47 




HLA A3 : nonamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parler's Score * 


SEQ ID : 


1 


258 


MLFIMFYSK 


900,000 


SEQ ID NO:48 


2 


148 


LLIRNIFLK 


135.000 


SEQ ID NO:49 


3 


221 


GLQEFDIAR 


108,000 


SEQ ID NO:50 



10 

• : Estimate of Half Time of Disassociation of a Molecule Containing This Subsequence. 



HLA A3 : decamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ID : 


1 


401 


KMFLIEKLIK 


600.000 


SEQIDN0:51 


2 


257 


IblLFIMFYSK 


270.000 


SECJID NO:52 


3 


407 


KLIKLQDMEK 


180.000 


SEQ ID NO:53 


4 


147 


VLLIRNIFLK 


135.000 


SEQ ID NO:54 



15 







HLA B7 : 


nonamers 




Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ID : 


1 


140 


MPRHEVYVL 


800.000 


SEQ ID NO:55 








HLA B7 : 


decamers 




Rank 


Start Position 


Subsequence Residue Listing 




Parker's Score * 


SEQ ID : 
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MPRHEVYVLL 



800.000 



SEQ ID NO:56 



HLA B4403 nonatners 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ID : 


1 


138 


YEMPRHEVY 


480.000 


SEQ ID NO:57 



HLA B4403 decamers 


Rank 


Start Position 


Subsequence Residue Listing 


Parker's Score * 


SEQ ID: 


1 


179 


WETLIGQDIY 


120.000 


SEQ ID NO:58 



*: Estimate of half time of disassociation of a molecule containing this subsequence. 



HLA-DRB1*1501 : nonamers 


Rank 


Start Position 


Subsequence Residue Listing 


Tepitope Score 


SEQ ID: 


1 


363 


VHFFFILTL 


6.5 


SEQ ID NO:59 


2 


210 


LRRIIGMQL 


6.2 


SEQ ID NO:60 




HLA-DRB1*0301 : nonamers 


Rank 


Start Position 


Subsequence Residue Listing 


Tepitope Score 


SEQ ID : 


1 


193 


LLMDFVFSL 


6.4 


SEQIDN0:61 




HLA-DRB1*0703 : nonamers 


Rank 


Start Position 


Subsequence Residue Listing 


Tepitope Score 


SEQ ID : 


1 


216 


MQLITSLGL 


8.5 


SEQ ID NO:62 


2 


197 


FVFSLVNSF 


8 


SEQ ID NO:63 


3 


367 


FILTLIVLI 


8 


SEQ ID NO:64 


4 


210 


LRRIIGMQL 


7.6 


SEQ ID NO:60 


5 


381 


WQITEGRKI 


7.6 


SEQ ID NO:65 


6 


148 


VLLIRNIFL 


7.3 


SEQ ED NO:49 


7 


265 


MFYSKNISL 


7.3 


SEQIDNO:66 


8 


263 


FIMFYSKNI 


7.2 


SEQ ID NO:67 


9 


305 


VLCTLAITI 


7.2 


SEQ ID NO: 17 


10 


284 


WRASQMMTF 


6.8 


SEQ ID NO:68 


11 


365 


FFFILTLIV 


6.8 


SEQ ID NO:69 


12 


133 


FRLVERYEM 


6.7 


SEQ ID NO:70 


13 


366 


FFILTLIVL 


6.4 


SEQ ID N0:71 


14 


238 


YAQTLVWI 


6.4 


SEQIDNO:72 
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SEQUENCE INFORMATION 
SEQ ED N0:1 

TGGGGAGGCAGAAGGCAGACTGATCACTTGAGGCCAGGAGTTTGAGACCTCAT 
GTCTAAAAAAAAAAAATTCTGTGAGGTGAGTTTTATTGTTATTCCCTCTCTACA 
ATATGGAAACTGAGGCTGAGAATCAGAACCATTCACAAGACAAAAATCCCCCAG 
TTGGCAGATCCAGGGTTGCAAGCCAGGCCTGTGCAGCCCCAAAACCAGTGCTTG 
TTTAACCACTGTGTGGTGACCACACCGCTCCAGGCCAACAGCTTGGGGCTAAGT 
CTTCACGTTGCCTTTCACCATTAAATAATAGGGCTGCCCTTTGTTGAAGCCCTGC 
ACTCCCAGTGACGGCCATAATAACCTTCAGGTGTTCTGCTTTCTGCCTTCTCTAG 
Catggccaagtatttccggaacaacttcattaatccccacatttactccggagggatcaccaagctgatcttttgctgggacttcactg 
tcactcatgaaaaagctgtgaagctaaaacagaagaatcttagcactgagataagggagaacctgtcagagctccgtcaggagaa 
ttccaagttgacgttcaatcagctgctgacccgcttctctgcctacatggtagcctgggttgtctctacaggagtggccatagcctgctg 
tgcagccgtttattacctggctgagtacaacttagagttcctgaagacacacagtaaccctggggcggtgctgttactgcctttcgttgt 
gtcctgcattaatctggccgtgccatgcatctactccatgttcaggcttgtggagaggtacgagatgccacggcacgaagtctacgttc 
tcctgatccgaaacatctttttgaaaatatcaatcattggcattctttgttactattggctcaacaccgtggccctgtctggtgaagagtg 
ttgggaaaccctcattggccaggacatctaccggctccttctgatggatmgtgttctcmagtcaattccttcctgggggagtttctga 
ggagaatcattgggatgcaactgatcacaagtcttggccttcaggagtttgacattgccaggaacgttctagaactgatctatgcaca 
aactctggtgtggattggcatcttcttctgccccctgctgccctttatccaaatgattatgcttttcatcatgttctactccaaaaatatca 
gcctgatgatgaatttccagcctccgagcaaagcctggcgggcctcacagatgatgattttcttcatcttcttgctctttttcccatcctt 
caccggggtcttgtgcaccctggccatcaccatctggagattgaagccttcagctgactgtggcccttttcgaggtctgcctctcttcat 
tcactccatctacagctggatcgacaccctaagtacacggcctggctacctgtgggttgtttggatctatcggaacctcattggaagtg 
tgcacttctttttcatcctcaccctcattgtgctaatcatcacctatctttactggcagatcacagagggaaggaagattatgataaggc 
tgctccatgagcagatcattaatgagggcaaagataaaatgttcctgatagaaaaattgatcaagctgcaggatatggagaagaaa 
gcaaaccccagctcacttgttctggaaaggagagaggtggagcaacaaggctttttgcatttgggggaacatgatggcagtcttgac 
ttgcgatctagaagatcagttcaagaaggtaatccaagggcctgaTGACTCTTTTGGTAACCAGACACCAA 
TCAAATAAGGGGAGGAGACGAAAATGGAATGATTTCTTCCATGCCACCTGTGCC 
TTTAGGAACTGCCCAGAAGAAAATCCAAGGCTTTAGCCAGGAGCGGAAACTGAC 
TACCATGTAATTATCAAAGTAAAATTGGGCATTCCATGCTATTTTTAATACCTGG 
ATTGCTGATTTTTCAAGACAAAATACTTGGGGTTTTCCAATAAAGA^ 
TATTGAAATGAGCCTACAAAAACCTAGGAAGAGATAACTAGGGAATAATGTATA 
TTATCnrCAAGAAATGTGTGCAGGAATGATTGGTTCTTAGAAATCTCTCCTGCCA 
GACTTCCCAGACCTGGCAAAGGTTTAGAAACTGTtGCTAAGAAAAGTGGTCCAT 
CCTGAATAAACATGTAATACTCCAGCAGGGATATGAAGCCTCTGAATTGTAGAA 
CCTGCATTTATTTGTGA(nTTGAACrAAAGACATCCCCCATGTCCCAAAGG 
AATACAACCAGAGGTCTCATCTCTGAACTTTCTTGCGTACTGATTACATGAGTCT 
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TTGGAGTCGGGGATGGAGGAGGTTCTGCCCCTGTGAGGTGTTATACATGACCAT 
CAAAGTCCTACGTCAAGCT 



5 SEQroNO:2 

MAKYFRNNFINPHIYSGGITKLIFCWDFTVTHEKAVKLKQKNLSTEIRENLSELRQENSKLTFNQLLTRFSA 
YMVAWWSTGVAIACCAAVYYLAEYNLEFLKTHSNPGAVLLLPFWSCINLAVPCIYSMFRLVERYEMPRHE 
VYVLLIRNIFLKISIIGILCYYWLNTVALSGEECWETLIGQDIYRLLLMDFVFSLVNSFLGEFLRRIIGMQL 
10 ITSLGLQEFDIARWLELIYAQTLVWIGIFFCPLLPFIQMIMLFIMFYSBCNISLMMNFQPPSKAWRASQMMT 
FFIFLLFFPSFTGVLCTLAITIWRLKPSADCGPFRGLPLFIHSIYSWIDTLSTRPGYLWWWIYRNLIGSVH 
FFFILTLIVLIITYLYWQITEGRKIMIRLLHEQIINEGKDKMFLIEKLIKLQDMEKKANPSSLVLERREVEQ 
QGFLHLGEHDGSLDLRSRRSVQEGNPRA 

15 SEQ ID NO:3 

TGGGGAGGCAGAAGGCAGACTGATCACTTGAGGCCAGGAGTTTGAGACCTCAT 

GTCTAAAAAAAAAAAATTCTGTGAGGTGAGTTTTATTGTTATTCCCTCTCT 

ATATGGAAACTGAGGCTGAGAATCAGAACCATTCACAAGACAAAAATCCCCCAG 

20 TTGGCAGATCCAGGGTTGCAAGCCAGGCCTGTGCAGCCCCAAAACCAGTGCTTG 
TTTAACCACTGTGTGGTGACCACACCGCTCCAGGCCAACAGCTTGGGGCTAAGT 
CTTCACGTTGCCTTTCACCATTAAATAATAGGGCTGCCCTTTGT^ 
ACTCCCAGTGACGGCCATAATAACCTTCAGGTGTTCTGCTTTCTGCCTTCTCTAG 
Catggccaagtatttccggaacaacttcattaatccccacatttactccggagggatcaccaagctgatcttttgctgggacttcactg 

25 tcactcatgaaaaagctgtgaagctaaaacagaagaatcttagcactgagataagggagaacctgtcagagctccgtcaggagaa 
nccaagttgacgttcaatcagctgctgacccgcttctctgcctacatggtagcctgggttgtctctacaggagtggccatagcctgctg 
tgcagccgtttattacctggctgagtacaacttagagttcctgaagacacacagtaaccctggggcggtgctgttactgcctttcgttgt 
gtcctgcattctggccgtgccatgcatctactccatgttcaggcttgtggagaggtacgagatgccacggcacgaagtctacgftctcc 
tgatccgcaggggattgatgtagTTCTCAAGTATGGGATGTACAGATGGGCAGGCAGTGCAC 

30 GCACAAAGGCTCCTGGGCTGAGGACGGGACTGAAATCATCCAGCGTTCCCCTTA 
GTCAAGCTAAACATCTTTTTGAAAATATCAATCATTGGCATTCTTTG 
GGCTCAACACCGTGGCCCTGTCTGGTGAAGAGTGTTGGGAAACCCTCATTGGCC 
AGGACATCTACCGGGTCCTTCTGATGGATTTTGTGTTCTCTTTAGTCAATTC 
CCTGGGGGAGTTTCTGAGGAGAATCATTGGGATGCAACTGATCACAAGTCTTGG 

35 CCrrCAGGAGTTTGACATTGCCAGGAACGTTCTAGAACTGATCTATGCACAAAC 
TCTGGTGTGGATTGGCATCnTCnTCTGCCCCCTGCTGCCCTTTATCCA^ 
ATGCTTTTCATCATGTTCTACTCCAAAAATATCAGCCTGATGATGAAT^ 
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CTCCGAGCAAAGCCTGGCGGGCCrCACAGATGATGACTTTCTTCATCTTCT^ 
TCITTTTCCCATCCTTCACCGGGGTCTTGTGCACCCTGGCCATCACCATCTGGA 
GATTGAAGCCTTCAGCTGACTGTGGCCCTTTTCGAGGTCTGCCTCTCTTCATTC 
ACTCCATCTACAGCTGGATCGACACCCTAAGTACACGGCCTGGCTACCTGTGGG 
5 TTGTTTGGATCTATCGGAACCTCATTGGAAGTGTGCACTTCTTTTTCATCCT 
CCTCATTGTGCTAATCATCACCTATCTTTACTGGCAGATCACAGAGGGAAGGAA 
GATTATGATAAGGCTGCTCCATGAGCAGATCATTAATGAGGGCAAAGATAAAAT 
GTTCCTGATAGAAAAATTGATCAAGCTGCAGGATATGGAGAAGAAAGCAAACCC 
CAGCTCACTTGTTCTGGAAAGGAGAGAGGTGGAGCAACAAGGCITTTTGCAT^ 

10 GGGGGAACATGATGGCAGTCTTGACTTGCGATCTAGAAGATCAGTTCAAGAAG 
GTAATCCAAGGGCCTGATGACTCTTTTGGTAACCAGACACCAATCAAATAAGGG 
GAGGAGACGAAAATGGAATGATTTCITCCATGCCACCTGTGCCTTTAGGAACTG 
CCCAGAAGAAAATCCAAGGCTTTAGCCAGGAGCGGAAACTGACTACCATGTAAT 
TATCAAAGTAAAATTGGGCATTCCATGCTATTTTTAATACCTGGATTGCTGAm 

15 TTCAAGACAAAATACTTGGGGTTTTCCAATAAAGATTGTTGTAATATTGA^ 
GCCTACAAAAACCTAGGAAGAGATAACTAGGGAATAATGTATATTATCTTCAAG 
AAATGTGTGCAGGAATGATTGGTTCITAGAAATCTCTCCTGCCAGACITCCCAG 
ACCTGGCAAAGGirrAGAAACTGTTGCTAAGAAAAGTGGTCCATCCTGAATAAA 
CATGTAATACTCCAGCAGGGATATGAAGCCTCTGAATTGTAGAACCTGCATTTA 

20 TTTGTGACTTTGAACTAAAGACATCCCCCATGTCCCAAAGGTGGAATACAACCA 
GAGGTCTCATCTCTGAACTTTCTTGCGTACTGATTACATGAGTCTTTGGAGTCG 
GGGATGGAGGAGGTTCTGCCCCTGTGAGGTGTTATACATGACCATCAAAGTCCT 
ACGTCAAGCT 
SEQroN0:4 

25 

MAKYFRNimNPHIYSGGITiaiFCWDFTVT^ 

NSKLTFNQLLTRFSAYMVAWWSTGVAUCCAAVYYLAEYNLEFXKTHSW 
LPFWSCILAVPaYSlVIFRLVERYEMPRHEVYVLLn^ 

30 SEQ ID N0:5 

atcttttgctgggacttcactgtcactcatgaaaaagctgtgaagctaaaacagaagaatcttagcactgag 
ataagggagaacctgtcagagctccgtcaggagaattccaagttgacgttcaatcagctgctgacccgcttc 
tctgcctacatggtagcctgggttgtctctacaggagtggccatagcctgctgtgcagccgtttattacctg 
35 gctga'gtacaacttagagttcctgaagacacacagtaaccctggggcggtgctgttactgcctttcgttgtg 
tcctgcattaatctggccgtgccatgcatctactccatgttcaggcttgtggagaggtacgagatgccacgg 
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cacgaagtctacgttctcctgatccgaaacatctttttgaaaatatcaatcattggcattctttgttactat 
tggctcaacaccgtggccctgtctggtgaagagtgttgggaaaccctcattggccaggacatctaccggctc 
cttctgatggattttgtgttctctttagtcaattccttcctgggggagtttctgaggagaatcattgggatg 
caactgatcacaagtcttggccttcaggagtttgacattgccaggaacgttctagaactgatctatgcacaa 
5 actctggtgtggattggcatcttcttctgccccctgctgccctttatccaaatgattatgcttttcatcatg 
ttctactccaaaaatatcagcctgatgatgaatttccagcctccgagcaaagcctggcgggcctcacagatg 
atgactttct teat cttcttgctctttttcccatctttcaccggggtcttgtgcaccctggccat caeca tc 
tggagattgaagccttcagctgactgtggcccttttcgaggtctgcctctcttcattcactccatctacagc 
tggatcgacaccctaagtacacggcctggctacctgtgggttgtttggatctatcggaacctcattggaagt 

10 gtgcacttctttttcatcctcaccctcattgtgctgatcatcacctatctttactggcagatcacagaggga 
aggaagattatgataaggctgctccatgagcagatcattaatgagggcaaagataaaatgttcctgatagaa 
aaattgatcaagctgcaggatatggagaagaaagcaaaccccagctcacttgttctggaaaggagagaggtg 
gagcaacaaggctttttgcatttgggggaacatgatggcagtcttgacttgcgatctagaagatcagttcaa 
gaaggtaatccaagggcctgaTGACTCTTTTGGTAACCAGACACCAATCAAATAAGGGGAGGAGATGAAAAT 

15 GGAATGATTTCTTCCATGCCACCTGTGCCTTTAGGAACTGCCCAGAAGAAAATCCAAGGCTTTAGCCAGGAG 
CGGAAACTGACTACCATGTAATTATCAAAGTAAAATTGGGCATTCCATGCTATTTTTAATACCTGGATTGCT 
GATTTTTCAAGACAAAATACTTGGGGTTTTCCAATAAAGATTGTTGTAATATTGAAATGAGCCTACAAAAAC 
CTAGGAAGAGATAACTAGGGAATAATGTATATTATCTTCAAGAAATGTGTGCAGGAATGATTGGTTCTTAGA 
AATCTCTCCTGCCAGACTTCCCAGACCTGGCAAAGGTTTAGAAACTGTTGCTAAGAAAAGTGGTCCATCCTG 

20 AATAAACATGTAATACTCCAGCAGGGATATGAAGCCTCTGAATTGTAGAACCTGCATTTATTTGTGACTTTG 
AACTAAAGACATCCCCCATGTCCCAAAGGTGGAATACAACCAGAGGTCTCATCTCTGAACTTTCTTGCGTAC 
TGATTACATGAGTCTTTGGAGTCGGGGATGGAGGAGGTTCTGCCCCTGTGAGGTGTTATACATGACCATCAA 
AGTCCTACGTCAAGCT 

25 SEQIDNO:6 

IFCWDFTVTHEKAVKLKQKNLSTEIRENLSELRQENSKLTFNQLLTRFSAYMVAWWSTGVAIACCAAVYYL 
AEYNLEFLKTHSNPGAVLLLPFVVSCINLAVPCIYSMFRLVERYEMPRHEVYVLLIRNIFLKISIIGILCYY 
WLNTVALSGEECWETLIGQDIYRLLLMDFVFSLVNSFLGEFLRRIIGMQLITSLGLQEFDIARNVLELIYAQ 
30 TLVWIGIFFCPLLPFIQMIMLFIMFYSKNISLMMNFQPPSECAWRASQMMTFFIFLLFFPSFTGVLCTLAITI 
WRLKPSADCGPFRGLPLFIHSIYSWIDTLSTRPGYLWWWIYRNLIGSVHFFFILTLIVLIITYLYWQITEG 
RKIMIRLLHEQIINEGKDKMFLIEKLIKLQDMEKKANPSSLVLERREVEQQGFLHLGEHDGSLDLRSRRSVQ 
EGNPRA 

35 SEQIDNO:? 

ctgatgatgaatttccagcctccgagcaaagcctggcgggcctcacagatgatgactttcttcatcttcttgctctttttcccatctttcac 

cggggtcttgtgcaccctggccatcaccatctggagattgaagccttcagctgactgtggcccttttcgaggtctgcctctcttcattca 

ctccatctacagctggatcgacaccctaagtacacggcctggctacctgtgggttgtttggatctatcggaacctcattggaagtgtgc 
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acttctttttcatcctcaccctcattgtgctgatcatcacctatctttactggcagatcacagagggaaggaagattatgataaggctgc 
tccatgagcagatcattaatgagggcaaagataaaatgttcctgatagaaaaattgatcaagctgcaggatatggagaagaaagc 
aaaccccagctcacttgttctggaaaggagagaggtggagcaacaaggctttttgcatttgggggaacatgatggcagtcttgactt 
gcgatctagaagatcagttcaagaaggtaatccaagggcctgaTGACTCTTTTGGTAACCAGACACCAAT 
5 CAAATAAGGGGAGGAGATGAAAATGGAATGATTTCTTGCATGCCACCTGTGCCT 
TTAGGAACTGCCCAGAAGAAAATCCAAGGCTTTAGCCAGGAGCGGAAACTGACT 
ACCATGTAATTATCAAAGTAAAATTGGGCATTCCATGCTATTTTTAATACCTGGA 
TTGCTGATTTTTCAAGACAAAATACTTGGGGTTTTCCAATAAA^^ 
ATTGAAATGAGCCTACAAAAACCTAGGAAGAGATAACTAGGGAATAATGTATAT 

10 TATCTTCAAGAAATGTGTGCAGGAATGATTGGTTCTTAGAAATCTCTCCTGCCA 
GACTTCCCAGACCTGGCAAAGGTTTAGAAACTGTTGCTAAGAAAAGTGGTCCAT 
CCTGAATAAACATGTAATACTCCAGCAGGGATATGAAGCCTCTGAATTGTAGAA 
CCTGCATTTATTTGTGACTTTGAACTAAAGACATCCCCCATGTCCCAAAGGTGG 
AATACAACCAGAGGTCTCATCTCTGAACTTTCTTGCGTACTGATTACATGAGTCT 

15 TTGGAGTCGGGGATGGAGGAGGTTCTGCCCCTGTGAGGTGTTATACATGACCAT 
CAAAGTCCTACGTCAAGCT 

SEQ ID NO:8 

20 LMMNFQPPSKAWRASQMMTFFIFLLFFPSFTGVLCTLAITIWRLKPSADCGPFRGLPLFIHSIYSWIDTLST 
RPGYLWWWIYRNLIGSVHFFFILTLIVLIITYLYWQITEGRKIMIRLLHEQIINEGKDKMFLIEKLIKLQD 
MEKKANPSSLVLERREVEQQGFLHLGEHDGSLDLRSRRSVQEGNPRA 

SEQIDN0:9 

25 

ITEGRKIMIR 
SEQroNO:10 
30 LLMDFVFSL 
SEQroNOtll 

FLLFFPSFT 

35 

SEQIDN0:12 

QMMTFFIFL 
40 SEQIDN0:13 
MMTFFIFLL 
SEQroN0:14 

45 
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FLIEKLIKL 

SEQroNO:15 

5 VLLIRNIFL 
SEQ ID NO:16 

LVWIGIFFC 

10 

SEQIDNO:17 

TLAITIWRL 
15 SEQroN0:18 
LIFCWDFTV 
SEQ ID NO:19 

20 

FLGEFLRRI 
SEQIDNO:20 
25 LLLPFWSC 
SEQIDNO:21 

LLTRFSAYM 

30 

SEQIDNO:22 

KLIFCWDFT 

35 SEQIDNO:23 
LLLMDFVFSL 
SEQIDNO:24 

40 

LLMDFVFSLV 
SEQ ID NO:25 
45 QMMTFFIFLL 
SEQn)NO:26 

YLAEYNLEFL 

50 

SEQIDNO:27 

KLIFCWDFTV 
55 SEQIDNO:28 

LLFFPSFTGV 
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SEQ ID NO:29 

QLLTRFSAYM 
5 SEQroNO:30 

TLVWIGIFFC 

SEQroN0:31 

10 

SQMMTFFIFL 
SEQIDNO:32 
15 VLLLPFWSC 
SEQroNO:33 

ILCYYWLNTV 

20 

SEQIDNO:34 

TLIVLIITYL 
25 SEQroNO:35 
YVLLIRNIFL 
SEQroNO:36 

30 

FVFSLVNSFL 
SEQIDNO:37 
35 LLLPFWSCI 
SEQroNO:38 

LIGSVHFFFX 

40 

SEQroNO:39 

YEMPRHEVYV 
45 SEQIDNO:40 
LLTRFSAYMV 
SEQroN0:41 

50 

lYSGGITKL 
SEQIDNO:42 
55 VYYLAEYNL 
SEQroNO:43 



wo 01/23417 



PCT/EPOO/09500 



10 



YYWLNTVAL 
SEQ ID NO:44 
lYRLLLMDF 
SEQ ID NO:45 
VYVLLIRNIF 
SEQ ID NO:46 

CYYWLNTVAL 
15 SEQIDNO:47 
YYLAEYNLEF 
SEQIDNO:48 

20 

MLFIMFYSK 
SEQIDNO:49 
25 LLIRNIFLK 
SEQIDNO:50 

GLQEFDIAR 

30 

SEQIDN0:51 

KMFLIEKLIK 
35 SEQ ID NO:52 
IMLFIMFYSK 
SEQIDNO:53 

40 

KLIKLQDMEK 
SEQ ID NO:54 
45 VLLIRNIFLK 
SEQ ID NO:55 

MPRHEVYVL 

50 

SEQ ID NO:56 

MPRHEVYVLL 
55 SEQIDNO:57 

YEMPRHEVY 
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SEQ ID NO:58 

WETLIGQDIY 
5 SEQ ID NO:59 
VHFFFILTL 
SEQIDNO:60 

10 

LRRIIGMQL 
SEQ ID NO:61 
15 LLMDFVFSL 
SEQ ID NO:62 

MQLITSLGL 

20 

SEQ ID NO:63 

FVFSLVNSF 
25 SEQ ID NO:64 
FILTLIVLI 

SEQ ID NO:65 

30 

WQITEGRKI 
SEQ ID NO:66 
35 MFYSKNISL 
SEQIDNO:67 

FIMFYSECNI 

40 

SEQIDNO:68 

WRASQMMTF 
45 SEQIDNO:69 
FFFILTLIV 
SEQIDNO:70 

50 

FRLVERYEM 
SEQIDN0:71 
55 FFILTLIVL 
SEQ ID NO:72 



71 



wo 01/23417 PCT/EPOO/09500 



YAQTLVWI 



1 
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Claims 

1. An isolated polypq)tide comprising an amino acid sequence which has at leas^ 
identity to the amino acid sequence of SEQ ED N0:2 or SEQ ID N0:4 over the entire 

5 length of of SEQ ID N0:2 or SEQ ID N0:4 respectively. 

2. An isolated polypeptide as claimed in claim 1 in which the amino acid sequence has at 
least 95% identity to SEQ ED N0:2 or SEQ ID N0:4. 

10 3. The polypeptide as claimed in claim 1 comprising the amino acid sequence of SEQ ID 
N0:2 or SEQ ID N0:4. 

4. The isolated polypeptide of SEQ ED N0:2 or SEQ ID N0:4. 

15 5. A polypeptide comprising an immunogenic fragment of a polypeptide as claimed in 
any one of claims 1 to 4 in which the immxmogenic activity of the immunogenic fragment 
is substantially the same as the polypeptide of SEQ ID N0:2 or SEQ ID N0:4. 

6. A polypeptide as claimed in any of claims 1 to 5 wherein said polypeptide is part of a 
20 larger fusion protein. 

7. A polypeptide as claimed in any of claims 1 to 6 chemically conjugated to a carrier 
protein. 

25 8. An isolated polynucleotide encoding a polypeptide as claimed in any of claims 1 to 6. 

9. An isolated polynucleotide comprising a nucleotide sequence encoding a polypeptide that 
has at least 70% identity to the amino acid sequence of SEQ ID N0:2 or SEQ ID N0:4, 
over the entire length of SEQ ED N0:2 or SEQ ID N0:4 respectively; or a nucleotide 

30 sequence complementary to said isolated polynucleotide. 

10. An isolated polynucleotide comprising a nucleotide sequence that has at least 70% 
identity to a nucleotide sequence encoding a polypeptide of SEQ ED N0:2 or SEQ ID N0:4, 
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over the entire coding region; or a nucleotide sequence complementary to said isolated 
polynucleotide. 

11. An isolated polynucleotide which comprises a nucleotide sequence which has at least 
5 70% identity to that of SEQ ID NO: 1 or SEQ ID N0:3 over the entire length of SEQ ID 

N0:1 or SEQ ID N0:3 respectively; or a nucleotide sequence complementary to said 
isolated polynucleotide. 

12. The isolated polynucleotide as defined in any one of claims 8 to 1 1 in which the 
10 identity is at least 95%. 

13. An isolated polynucleotide selected from: 

(a) a polynucleotide comprising a nucleotide sequence encoding the polypeptide of SEQ 
IDN0:2 orSEQIDNO:4; 

15 (b) the coding region of the polynucleotide of SEQ ID NO: 1 or SEQ ID N0:3; and 
(c) a polynucleotide obtainable by screening an appropriate library under stringent 
hybridization conditions with a labeled probe having the sequence of SEQ ID N0:1 or SEQ 
ID N0:3 or a fragment thereof said polynucleotide encoding a protein which has similar 
immunogenic properties to those of the protein of sequence ID N0:2 or SEQ ID N0:4 

20 respectively or a nucleotide sequence complementary to said isolated polynucleotide 

14. An expression vector or a recombinant Uve microorganism comprising an isolated 
polynucleotide according to any one of claims 8-13. 

25 15. A host cell comprising the expression vector of claim 14 or the isolated 
polynucleotide of claims 8 to 13. 

16. A process for producing a polypeptide of claims 1 to 7 comprising culturing a host 
cell of claim 15 under conditions sufficient for the production of said polypeptide and 

30 recovering the polypeptide from the culture medium. 

17. A vaccine comprising an effective amount of the polypeptide of any one of claims 1 
to 7 and a pharmaceutically acceptable carrier. 
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1 8. A vaccine comprising an effective amount of the polynucleotide of any one of claims 
8 to 13 and a phaimaceutically effective carrier. 

5 19. A vaccine comprising an effective amount of antigen presenting cells, modified by in 
vitro loading with a polypeptide of any one of claims 1 to 7, or genetically modified in 
vitro to express a polypeptide of claims 1 to 7 and a phaimaceutically effective carrier. 

20. A vaccine as claimed in any one of claims 17 to 19 which additionally comprises a 
10 TH-1 inducing adjuvant. 

21. A vaccine as claimed in claim 20 in which the TH-1 inducing adjuvant is selected 
from the group of adjuvants comprising: 3D-MPL, QS21, a mixture of QS21 and 
cholesterol, and a CpG oligonucleotide. 

15 

22. An antibody immunospecific for the polypeptide or immunological fragment as 
claimed in any one of claims 1 to 5. 

23. A method for screening to identify compounds which stimulate or which inhibit the 
20 function of the polypeptide of any one of claims 1 to 5 which comprises a method selected 

from the group consisting of: 

(a) measuring the binding of a candidate compound to the said polypeptide (or to the 
cells or membranes bearing the polypeptide) or a fusion protein thereof by means of a 
label directly or indirectly associated with the candidate compound; 
25 (b) measuring the binding of a candidate compound to the said polypeptide (or to the 

cells or membranes bearing the polypeptide) or a fusion protein thereof in the presence of 
a labeled competitior, 

(c) testing whether the candidate compound results in a signal generated by activation or 
inhibition of the said polypeptide, using detection systems appropriate to the cells or cell 

30 membranes bearing the polypeptide; 

(d) mixing a candidate compound with a solution containing a polypeptide of any one of 
claims 1 to 7, to form a mixture, measuring activity of the polypeptide in the mixture, and 
comparing the activity of the mixture to a standard; or 
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(e) detecting the effect of a candidate compound on the production of mRNA encoding 
said polypeptide and said polypeptide in cells, using for instance, an ELISA assay. 

24. A method for the treatment of a subject by immxmoprophylaxis or therapy 

5 comprising in vitro induction of immune responses to a molecule of any one of claims 1 
to 5, using in vitro incubation of the polypeptide of any one of claims 1 to 7 or the 
polynucleotide of any one of claims 8 to 13 with cells from the immune system of a 
mammal, and reinflising these activated immune cells to the mammal for the treatment of 
disease. 

10 

25. A method as claimed in claim 24 wherein the treatment is for ovarian or colon 
cancer. 

26. An agonist or antagonist to the polypeptide of claims 1 to 5. 

15 

27. A compound which is: 

(a) an agonist or antagonist to the polypeptide of claims 1 to 5; 

(b) isolated polynucleotide of claims 8 to 13; or 

(c) a nucleic acid molecule that modulates the expression of the nucleotide sequence 
20 encoding the polypeptide of any one of claims 1 to 5; 

for use in therapy. 

28. A process for diagnosing a disease or a susceptibihty to a disease in a subject related 
to expression or activity of a polypeptide of any one of claims 1 to 5 in a subject 

25 comprising analyzing for the presence or amount of said polypeptide in a sample derived 
from said subject. 

29. A process for diagnosing a disease or a susceptibility to a disease in a subject related 
to expression or activity of a polynucleotide of any one of claims 8 to 1 3 in a subject 

30 comprising analyzing for the presence or amount of said polynucleotide in a sample 
derived from said subject. 
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30. A process for diagnosing the presence of colon cancer or a susceptibility to colon 
cancer in a subject related to expression or activity of a polypeptide of any one of claims 1 
to 5 in a subject comprising analyzing for the presence or amount of said polypeptide in a 
sample derived from said subject. 

5 

31. A process for diagnosing the presence of colon cancer or a susceptibility to colon 
cancer in a subject related to expression or activity of a polynucleotide of any one of 
claims 8 to 13 in a subject comprising analyzing for the presence or amount of said 
polynucleotide in a sample derived from said subject. 

10 

32. An isolated polynucleotide selected from the group consisting of 

(a) an isolated polynucleotide comprising a nucleotide sequence which has at least 70% 
identity to SEQ ID N0:5 or SEQ ID N0:7 over the entire length of SEQ ID N0:5 or SEQ 
ID N0:7 respectively; 

15 (b) an isolated polynucleotide comprising the polynucleotide of SEQ ID N0:5 or SEQ ID 
N0:7; 

(c) the polynucleotide of SEQ ID N0:5 or SEQ ID N0:7. 

33. A live vaccine composition comprising an expression vector or recombinant live 
20 micro-organism according to claim 14. 

34. Use of a polynucleotide as claimed in any one of claims 8 to 13 for the manufacture 
of a medicament in the treatment of carcinoma. 

25 35. Use of a polynucleotide as claimed in any one of claims 8 to 1 3 for the manufacture 
of a medicament in the treatment of colon carcinoma. 

36. Use of a polypeptide as claimed in any one of claims 1 to 7 for the manufacture of a 
medicament in the treatment of carcinoma. 

30 

37. Use of a polypeptide as claimed in any one of claims 1 to 7 for the manufacture of a 
medicament in the treatment of colon carcinoma. 
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Figure 1: RT PGR data of CASB641 1 transcript on coloractal tumours 
and matched normal colon. 
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Legend: 

Ad_G!: adrenal gland; 
Ao: aorta; Bl: bladder, 

Bo_Ma: bone marrow; 
Bra: brain; 

Ce: cervix; Co: colon; 

Fa_Tu: fallopian tube; He: heart; 

II: ileon; Ki: kidney; Li: liver; 

Lu: lung; Ly_No: lymph node; 

Oe: oesophagus; Ov: ovary; 

Pa_Thy: parathyroid gland; 

PI: placenta; Pr: prostate; 

Re: rectum; Sk: skin; 

Sk_Mu: skeletal muscle; 

Sm_In: small intestine; 

Sp: spleen; St: stomach; Te: testis; 

Thy: thyroid gland; Tra: trachea; 

Bre: Breast. 



Figure 2 : RT PGR data of CASB7641 1 
transcript in a panel of normal tissues. 
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SEQUENCE LISTING 
<110> SmithKline Beecham Biologicals 
<120> Novel compounds 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



<130> BC45263 
<160> 72 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 2407 
<212> DNA 

<213> Artificial Sequence 



<400> 
tggggaggca 
aaaaaaaaat 
aggctgagaa 
caagccaggc 
ccgctccagg 
gggctgccct 
ctgctttctg 
atttactccg 
aaagctgtga 
ctccgtcagg 
atggtagcct 
ctggctgagt 
ctgcctttcg 
cttgtggaga 
tttttgaaaa 
tctggtgaag 
gattttgtgt 
atgcaactga 
ctgatctatg 
atccaaatga 
ttccagcctc 
ctctttttcc 
aagccttcag 
agctggatcg 
aacctcattg 
tatctttact 
atcattaatg 
atggagaaga 
ggctttttgc 
caagaaggta 
ggaggagacg 
aagaaaatcc 
aattgggcat 
tggggttttc 
gataactagg 
agaaatctct 
aagtggtcca 
tagaacctgc 
atacaaccag 
tcggggatgg 
tcaagct 



gaaggcagac 
tctgtgaggt 
tcagaaccat 
ctgtgcagcc 
ccaacagctt 
ttgttgaagc 
ccttctctag 
gagggatcac 
agctaaaaca 
agaattccaa 
gggttgtctc 
acaacttaga 
ttgtgtcctg 
ggtacgagat 
tatcaatcat 
agtgttggga 
tctctttagt 
tcacaagtct 
cacaaactct 
ttatgctttt 
cgagcaaagc 
catccttcac 
ctgactgtgg 
acaccctaag 
gaagtgtgca 
ggcagatcac 
agggcaaaga 
aagcaaaccc 
atttggggga 
atccaagggc 
aaaatggaat 
aaggctttag 
tccatgctat 
caataaagat 
gaataatgta 
cctgccagac 
tcctgaataa 
atttatttgt 
aggtctcatc 
aggaggttct 



tgatcacttg 

gagttttatt 

tcacaagaca 

ccaaaaccag 

ggggctaagt 

cctgcactcc 

catggccaag 

caagctgatc 

gaagaatctt 

gttgacgttc 

tacaggagtg 

gttcctgaag 

cattaatctg 

gccacggcac 

tggcattctt 

aaccctcatt 

caattccttc 

tggccttcag 

ggtgtggatt 

catcatgttc 

ctggcgggoc 

cggggtcttg 

cccttttcga 

tacacggcct 

cttctttttc 

agagggaagg 

taaaatgttc 

cagctcactt 

acatgatggc 

ctgatgactc 

gatttcttcc 

ccaggagcgg 

ttttaatacc 

tgttgtaata 

tattatcttc 

ttcccagacc 

acatgtaata 

gactttgaac 

tctgaacttV 

gcccctgtga 



aggccaggag 
gttattccct 
aaaatccccc 
tgcttgttta 
cttcacgttg 
cagtgacggc 
tatttccgga 
ttttgctggg 
agcactgaga 
aatcagctgc 
gccatagcct 
acacacagta 
gccgtgccat 
gaagtctacg 
tgttactatt 
ggccaggaca 
ctgggggagt 
gagtttgaca 
ggcatcttct 
tactccaaaa 
tcacagatga 
tgcaccctgg 
ggtctgcctc 
ggctacctgt 
atcctcaccc 
aagattatga 
ctgatagaaa 
gttctggaaa 
agtcttgact 
ttttggtaac 
atgccacctg 
aaactgacta 
tggattgctg 
ttgaaatgag 
aagaaatgtg 
tggcaaaggt 
ctccagcagg 
taaagacatc 
cttgcgtact 
ggtgttatac 



tttgagacct 
ctctacagat 
agttggcaga 
accactgtgt 
cctttcacca 
cataataacc 
acaacttcat 
acttcactgt 
taagggagaa 
tgacccgctt 
gctgtgcagc 
accctggggc 
gcatctactc 
ttctcctgat 
ggctcaacac 
tctaccggct 
ttctgaggag 
ttgccaggaa 
tctgccccct 
atatcagcct 
tgactttctt 
ccatcaccat 
tcttcattca 
gggttgtttg 
tcattgtgct 
taaggctgct 
aattgatcaa 
ggagagaggt 
tgcgatctag 
cagacaccaa 
tgcctttagg 
ccatgtaatt 
atttttcaag 
cctacaaaaa 
tgcaggaatg 
ttagaaactg 
gatatgaagc 
ccccatgtcc 
gattacatga 
atgaccatca 



catgtctaaa 
atggaaactg 
tccagggttg 
ggtgaccaca 
ttaaataata 
ttcaggtgtt 
taatccccac 
cactcatgaa 
cctgtcagag 
ctctgcctac 
cgtttattac 
ggtgctgtta 
catgttcagg 
ccgaaacatc 
cgtggccctg 
ccttctgatg 
aatcattggg 
cgttctagaa 
gctgcccttt 
gatgatgaat 
catcttcttg 
ctggagattg 
ctccatctac 
gatctatcgg 
aatcatcacc 
ccatgagcag 
gctgcaggat 
ggagcaacaa 
aagatcagtt 
tcaaataagg 
aactgcccag 
atcaaagtaa 
acaaaatact 
cctaggaaga 
attggttctt 
ttgctaagaa 
ctctgaattg 
caaaggtgga 
gtctttggag 
aagtcctacg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
> 960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1580 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2407 



<210> 2 
<211> 460 
<212> PRT 

<213> Artificial Sequence 



1 



wo 01/23417 



PCT/EPOO/09500 



<400> 2 

Met Ala Lys Tyr Phe Arg Asn Asn Phe lie Asn Pro His He Tyr Ser 
1 5 10 15 

5 Gly Gly He Thr Lys Leu He Phe Cys Trp Asp Phe Thr Val Thr His 
20 25 30 

Glu Lys Ala Val Lys Leu Lys Gin Lys Asn Leu Ser Thr Glu He Arg 

35 40 45 

Glu Asn Leu Ser Glu Leu Arg Gin Glu Asn Ser Lys Leu Thr Phe Asn 
10 50 55 60 

Gin Leu Leu Thr Arg Phe Ser Ala Tyr Met Val Ala Trp Val Val Ser 
65 70 75 80 

Thr Gly Val Ala He Ala Cys Cys Ala Ala Val Tyr Tyr Leu Ala Glu 
85 90 95 

15 Tyr Asn Leu Glu Phe Leu Lys Thr His Ser Asn Pro Gly Ala Val Leu 
100 105 110 

Leu Leu Pro Phe Val Val Ser Cys He Asn Leu Ala Val Pro Cys He 

115 120 125 

Tyr Ser Met Phe Arg Leu Val Glu Arg Tyr Glu Met Pro Arg His Glu 
20 130 135 140 

Val Tyr Val Leu Leu He Arg Asn He Phe Leu Lys He Ser He He 
145 150 155 . 160 

Gly He Leu Cys Tyr Tyr Trp Leu Asn Thr Val Ala Leu Ser Gly Glu 
165 170 175 

25 Glu Cys Trp Glu Thr Leu He Gly Gin Asp He Tyr Arg Leu Leu Leu 
180 185 190 

Met Asp Phe Val Phe Ser Leu Val Asn Ser Phe Leu Gly Glu Phe Leu 

195 200 205 

Arg Arg He He Gly Met Gin Leu He Thr Ser Leu Gly Leu Gin Glu 
30 210 215 220 

Phe Asp He Ala Arg Asn Val Leu Glu Leu He Tyr Ala Gin Thr Leu 
225 230 235 240 

Val Trp He Gly He Phe Phe Cys Pro Leu Leu Pro Phe He Gin Met 
245 250 . 255 

35 He Met Leu Phe He Met Phe Tyr Ser Lys Asn He Ser Leu Met Met 
260 265 270 

Asn Phe Gin Pro Pro Ser Lys Ala Trp Arg Ala Ser Gin Met Met Thr 

275 280 285 

Phe Phe He Phe Leu Leu Phe Phe Pro Ser Phe Thr Gly Val Leu Cys 
40 290 295 300 

Thr Leu Ala He Thr He Trp Arg Leu Lys Pro Ser Ala Asp Cys Gly 
305 310 315 320 

Pro Phe Arg Gly Leu Pro Leu Phe He His Ser He Tyr Ser Trp He 
325 330 335 

45 Asp Thr Leu Ser Thr Arg Pro Gly Tyr Leu Trp Val Val Trp He Tyr 
340 345 350 

Arg Asn Leu He Gly Ser Val His Phe Phe Phe He Leu Thr Leu He 

355 360 365 

Val Leu He He Thr Tyr Leu Tyr Trp Gin He Thr Glu Gly Arg Lys 
50 370 375 380 

He Met He Arg Leu Leu His Glu Gin He He Asn Glu Gly Lys Asp 
385 390 395 400 

Lys Met Phe Leu He Glu Lys Leu He Lys Leu Gin Asp Met Glu Lys 
405 410 415 

55 Lys Ala Asn Pro Ser Ser Leu Val Leu Glu Arg Arg Glu Val Glu Gin 
420 425 430 

Gin Gly Phe Leu His Leu Gly Glu His Asp Gly Ser Leu Asp Leu Arg 

435 440 445 

Ser Arg Arg Ser Val Gin Glu Gly Asn Pro Arg Ala 
60 450 455 460 

<210> 3 
<211> 2521 
<212> DNA 

65 <213> Artificial Sequence 



2 



wo 01/23417 PCT/EPOO/09500 



<400> 3 

tggggaggca gaaggcagac tgatcacttg aggccaggag tttgagacct catgtctaaa 60 

aaaaaaaaat tctgtgaggt gagttttatt gttattccct ctctacagat atggaaactg 120 

aggctgagaa tcagaaccat tcacaagaca aaaatccccc agttggcaga tccagggttg 180 

5 caagccaggc ctgtgcagcc ccaaaaccag tgcttgttta accactgtgt ggtgaccaca 240 

ccgctccagg ccaacagctt ggggctaagt cttcacgttg cctttcacca ttaaataata 300 

gggctgccct ttgttgaagc cctgcactcc cagtgacggc cataataacc ttcaggtgtt 360 

ctgctttctg ccttctctag catggccaag tatttccgga acaacttcat taatccccac 420 

atttactccg gagggatcac caagctgatc ttttgctggg acttcactgt cactcatgaa 480 

10 aaagctgtga agctaaaaca gaagaatctt agcactgaga taagggagaa cctgtcagag 540 

ctccgtcagg agaattccaa gttgacgttc aatcagctgc tgacccgctt ctctgcctac 600 

atggcagcct gggttgtctc tacaggagtg gccatagcct gctgtgcagc cgtttattac 660 

ctggctgagt acaacttaga gttcctgaag acacacagta accctggggc ggtgctgtta 720 

ctgcctttcg ttgtgtcctg cattctggcc gtgccatgca tctactccac gttcaggctt 780 

15 gtggagaggt acgagatgcc acggcacgaa gtctacgttc tcctgatccg caggggattg 840 

atgtagttct caagtatggg atgtacagat gggcaggcag tgcacgcaca aaggctcctg 900 

ggctgaggac gggactgaaa tcatccagcg ttccccttag tcaagctaaa catctttttg 960 

aaaatatcaa tcattggcat tctttgttac tattggctca acaccgtggc cctgtctggt 1020 

gaagagtgtt gggaaaccct cattggccag gacatctacc ggctccttct gatggatttt 1080 

20 gtgttctctt tagtcaattc cttcctgggg gagtttctga ggagaatcat tgggatgcaa 1140 

ctgatcacaa gtcttggcct tcaggagttt gacattgcca ggaacgttct agaactgatc 1200 

tatgcacaaa ctctggtgtg gattggcatc ttcttctgcc ccctgctgcc ctttatccaa 1260 

atgattatgc ttttcatcat gttctactcc aaaaatatca gcctgatgat gaatttccag 1320 

cctccgagca aagcctggcg ggcctcacag atgatgactt tcttcatctt cttgctcttt 1380 

25 ttcccatcct tcaccggggt cttgtgcacc ctggccatca ccatctggag attgaagcct 1440 

tcagctgact gtggcccttt tcgaggtctg cctctcttca ttcactccat ctacagctgg 1500 

atcgacaccc taagtacacg gcctggctac ctgtgggttg tttggatcta tcggaacctc 1560 

attggaagtg tgcacttctt tttcatcctc accctcattg tgctaatcat cacctatctt 1620 

tactggcaga tcacagaggg aaggaagatt atgataaggc tgctccatga gcagatcatt 1680 

30 aatgagggca aagataaaat gttcctgata gaaaaattga tcaagctgca ggatatggag 1740 

aagaaagcaa accccagctc acttgttctg gaaaggagag aggtggagca acaaggcttt 1800 

ttgcatttgg gggaacatga tggcagtctt gacttgcgat ctagaagatc agttcaagaa 1860 

ggtaatccaa gggcctgatg actcttttgg taaccagaca ccaatcaaat aaggggagga 1920 

gacgaaaatg gaatgatttc ttccatgcca cctgtgcctt taggaactgc ccagaagaaa 1980 

35 atccaaggct ttagccagga gcggaaactg actaccatgt aattatcaaa gtaaaattgg 2040 

gcattccatg ctatttttaa tacctggatt gctgattttt caagacaaaa tacttggggt 2100 

tttccaataa agattgttgt aatattgaaa tgagcctaca aaaacctagg aagagataac 2160 

tagggaataa tgtatattat cttcaagaaa tgtgtgcagg aatgattggt tcttagaaat 2220 

ctctcctgcc agacttccca gacctggcaa aggtttagaa actgttgcta agaaaagtgg 2280 

40 tccatcctga ataaacatgt aatactccag cagggatatg aagcctctga attgtagaac 2340 

ctgcatttat ttgtgacttt gaactaaaga catcccccat' gtcccaaagg tggaatacaa 2400 

ccagaggtct catctctgaa ctttcttgcg tactgattac atgagtcttt ggagtcgggg 24 60 

atggaggagg ttctgcccct gtgaggtgtt atacatgacc atcaaagtcc tacgtcaagc 2520 

t 2521 



45 



50 



55 



60 



65 



<210> 4 
<211> 154 
<212> PRT 

<213> Artificial Sequence 
<400> 4 



Met 


Ala 


Lys 


Tyr 


Phe 


Arg 


Asn 


Asn 


Phe 


He 


Asn 


Pro 


His 


He 


Tyr 


Ser 


1 






5 










10 










15 




Gly 


Gly 


He 


Thr 


Lys 


Leu 


He 


Phe 


Cys 


Trp 


Asp 


Phe 


Thr 


Val 


Thr 


His 






20 










23 










30 






Giu 


Lys 


Ala 


Val 


Lys 


Leu 


Lys 


Gin 


Lys 


Asn 


Leu 


Ser 


Thr 


Giu 


He 


Arg 




35 










40 










45 








Giu 


Asn 
50 


Leu 


Ser 


Giu 


Leu 


Arg 
55 


Gin 


Giu 


Asn 


Ser 


Lys 
60 


Leu 


Thr 


Phe 


Asn 


Gin 


Leu 


Leu 


Thr 


Arg 


Phe 


Ser 


Ala 


Tyr 


Met 


Val 


Ala 


Trp 


Val 


Val 


Ser 


65 










70 










75 










80 


Thr 


Gly 


Val 


Ala 


lie 


Ala 


Cys 


Cys 


Ala 


Ala 


Val 


Tyr 


Tyr 


Leu 


Ala 


Giu 








85 










90 










95 




Tyr 


Asn 


Leu 


Giu 


Phe 


Leu 


Lys 


Thr 


His 


Ser 


Asn 


Pro 


Gly 


Ala 


Val 


Leu 






100 










105 










110 






Leu 


Leu 


Pro 


Phe 


Val 


Val 


Ser 


Cys 


He 


Leu 


Ala 


Val 


Pro 


Cys 


He 


Tyr 



3 



wo 01/23417 
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113 120 125 

Ser Met Phe Arg Leu Val Glu Arg Tyr Glu Met Pro Arg His Glu Val 

130 135 140 

Tyr Val Leu Leu He Arg Arg Gly Leu Met 
145 150 



<210> 5 
<211> 1960 
<212> DNA 

10 <213> Artificial Sequence 



<400> 5 

atcttttgct gggacttcac tgtcactcat 
cttagcactg agataaggga gaacctgtca 

15 ttcaatcagc tgctgacccg cttctctgcc 
grggccatag cctgctgtgc agccgtttat 
aagacacaca gtaaccctgg ggcggtgctg 
ctggccgtgc catgcatcta ctccatgttc 
cacgaagtct acgttctcct gatccgaaac 

20 ctttgttact attggctcaa caccgtggcc 
attggccagg acatctaccg gctccttctg 
ttcctggggg agtttctgag gagaatcatt 
caggagtttg acattgccag gaacgttcta 
attggcatct tcttctgccc cctgctgccc 

25 ttctactcca aaaatatcag cctgatgatg 
gcctcacaga tgatgacttt cttcatcttc 
ttgtgcaccc tggccatcac catctggaga 
cgaggtctgc ctctcttcat tcactccatc 
cctggctacc tgtgggttgt ttggatctat 

30 ttcatcctca ccctcattgt gctgatcatc 
aggaagatta tgataaggct gctccatgag 
ttcctgatag aaaaattgat caagctgcag 
cttgttctgg aaaggagaga ggtggagcaa 
ggcagtcttg acttgcgatc tagaagatca 

35 ctcttttggt aaccagacac caatcaaata 
tccatgccac ctgtgccttt aggaactgcc 
cggaaactga ctaccatgta attatcaaag 
acctggattg ctgatttttc aagacaaaat 
atattgaaat gagcctacaa aaacctagga 

40 ttcaagaaat gtgtgcagga atgattggtt 
acctggcaaa ggtttagaaa ctgttgctaa 
atactccagc agggatatga agcctctgaa 
aactaaagac atcccccatg tcccaaaggt 
tttcttgcgt actgattaca tgagtctttg 

45 tgaggtgtta tacatgacca tcaaagtcct 

<210> 6 
<211> 438 
<212> PRT 

50 <213> artifical sequence 



gaaaaagctg tgaagctaaa acagaagaat 60 

gagctccgtc aggagaattc caagttgacg 120 

tacatggtag cctgggttgt ctctacagga 180 

tacctggctg agtacaactt agagttcctg 240 

ttactgcctt tcgttgtgtc ctgcattaat 300 

aggcttgtgg agaggtacga gatgccacgg 360 

atctttttga aaatatcaat cattggcatt 420 

ctgtctggtg aagagtgttg ggaaaccctc 480 

atggattttg tgttctcttt agtcaattcc 540 

gggatgcaac tgatcacaag tcttggcctt 600 

gaactgatct atgcacaaac tctggtgtgg 660 

tttatccaaa tgattatgct tttcatcatg 720 

aatttccagc ctccgagcaa agcctggcgg 780 

ttgctctttt tcccatcttt caccggggtc 840 

ttgaagcctt cagctgactg tggccctttt 900 

tacagctgga tcgacaccct aagtacacgg 960 

cggaacctca ttggaagtgt gcacttcttt 1020 

acctatcttt actggcagat cacagaggga 1080 

cagatcatta atgagggcaa agataaaatg 1140 

gatatggaga agaaagcaaa ccccagctca 1200 

caaggctttt tgcatttggg ggaacatgat 1260 

gttcaagaag gtaatccaag ggcctgatga 1320 

aggggaggag atgaaaatgg aatgatttct 1380 

cagaagaaaa tccaaggctt tagccaggag 1440 

taaaattggg cattccatgc tatttttaat 1500 

acttggggtt ttccaataaa gattgttgta 1560 

agagataact agggaataat gtatattatc 1620 

cttagaaatc tctcctgcca gacttcccag 1680 

gaaaagtggt ccatcctgaa taaacatgta 17 40 

ttgtagaacc tgcatttatt tgtgactttg 1800 

ggaatacaac cagaggtctc atctctgaac 1860 

gagtcgggga tggaggaggt tctgcccctg 1920 

acgtcaagct 1960 



<400> 6 





He 


Phe 


Cys 


Trp 


Asp 


Phe Thr 


Val 


Thr 


His 


Glu 


Lys 


Ala 


Val 


Lys 


Leu 




1 








5 








10 










15 




55 


Lys 


Gin 


Lys 


Asn 
20 


Leu 


Ser Thr 


Glu 


He 
25 


Arg 


Glu 


Asn 


Leu 


Ser 
30 


Glu 


Leu 




Arg 


Gin 


Glu 


Asn 


Ser 


Lys Leu 


Thr 


Phe 


Asn 


Gin 


Leu 


Leu 


Thr Arg 


Phe 








35 








40 










45 










Ser 


Ala 


Tyr 


Met 


Val 


Ala Trp 


Val 


Val 


Ser 


Thr 


Gly 


Val- 


Ala 


^ Ile^ Ala 


60 




50 








55 










60 












Cys 


Cys 


Ala 


Ala 


Val 


Tyr Tyr 


Leu 


Ala 


Glu 


Tyr 


Asn 


Leu 


Glu 


Phe 


Leu 




65 










70 








75 










80 




Lys 


Thr 


His 


Ser 


Asn 


Pro Gly 


Ala 


Val 


Leu 


Leu 


Leu 


Pro 


Phe 


Val 


Val 










85 








90 










95 




65 


Ser 


Cys 


He 


Asn 


Leu 


Ala Val 


Pro 


Cys 


He 


Tyr 


Ser 


Met 


Phe 


Arg 


Leu 



100 105 110 



4 
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Val Glu Arg Tyr Glu Met Pro Arg His Glu Val Tyr Val Leu Leu He 

115 120 125 

Arg Asn He Phe Leu Lys He Ser He He Gly He Leu Cys Tyr Tyr 
130 135 140 

5 Trp Leu Asn Thr Val Ala Leu Ser Gly Glu Glu Cys Trp Glu Thr Leu 
145 150 155 160 

He Gly Gin Asp He Tyr Arg Leu Leu Leu Met Asp Phe Val Phe Ser 

165 170 175 

Leu Val Asn Ser Phe Leu Gly Glu Phe Leu Arg Arg He He Gly Met 
10 180 185 190 

Gin Leu He Thr Ser Leu Gly Leu Gin Glu Phe Asp He Ala Arg Asn 

195 200 205 

Val Leu Glu Leu He Tyr Ala Gin Thr Leu Val Trp He Gly He Phe 
210 215 220 

15 Phe Cys Pro Leu Leu Pro Phe He Gin Met He Met Leu Phe He Met 
225 230 235 240 

Phe Tyr Ser Lys Asn He Ser Leu Met Met Asn Phe Gin Pro Pro Ser 

245 250 255 

Lys Ala Trp Arg Ala Ser Gin Met Met Thr Phe Phe He Phe Leu Leu 
20 * 260 265 270 

Phe Phe Pro Ser Phe Thr Gly Val Leu Cys Thr Leu Ala He Thr He 

275 280 285 

Trp Arg Leu Lys Pro Ser Ala Asp Cys Gly Pro Phe Arg Gly Leu Pro 
290 295 300 

25 Leu Phe He His Ser He Tyr Ser Trp He Asp Thr Leu Ser Thr Arg 
305 310 315 320 

Pro Gly Tyr Leu Trp Val Val Trp He Tyr Arg Asn Leu He Gly Ser 

325 330 335 

Val His Phe Phe Phe He Leu Thr Leu He Val Leu He He Thr Tyr 
30 340 345 350 

Leu Tyr Trp Gin He Thr Glu Gly Arg Lys He Met He Arg Leu Leu 

355 360 365 

His Glu Gin He He Asn Glu Gly Lys Asp Lys Met Phe Leu He Glu 
370 375 380 

35 Lys Leu He Lys Leu Gin Asp Met Glu Lys Lys Ala Asn Pro Ser Ser 
385 390 395 400 

Leu Val Leu Glu Arg Arg Glu Val Glu Gin Glh Gly Phe Leu His Leu 

405 410 415 

Gly Glu His Asp Gly Ser Leu Asp Leu Arg Ser Arg Arg Ser Val Gin 
40 420 425 430 

Glu Gly Asn Pro Arg Ala 
435 

<210> 7 
45 <211> 1219 

<212> DNA 

<213> Artificial Sequence 
<400> 7 

50 ctgatgatga atttccagcc tccgagcaaa gcctggcggg cctcacagat gatgactttc 60 

ttcatcttct tgctcttttt cccatctttc accggggtct tgtgcaccct ggccatcacc 120 

atctggagat tgaagccttc agctgactgt ggcccttttc gaggtctgcc tctcttcatt 180 

cactccatct acagctggat cgacacccta agtacacggc ctggctacct gtgggttgtt 240 

tggatctatc ggaacctcat tggaagtgtg cacttctttt tcatcctcac cctcattgtg 300 

55 ctgatcatca cctatcttta ctggcagatc acagagggaa ggaagattat gataaggctg 360 

ctccatgagc agatcattaa tgagggcaaa gataaaatgt tcctgataga aaaattgatc 420 

aagctgcagg atatggagaa gaaagcaaac cccagctcac ttgttctgga aaggagagag 480 

gtggagcaac aaggcttttt gcatttgggg gaacatgatg gcagtcttga cttgcgatct 540 

agaagatcag ttcaagaagg taatccaagg gcctgatgac tcttttggta accagacacc 600 

60 aatcaaataa ggggaggaga tgaaaatgga atgatttctt ccatgccacc tgtgccttta 660 

ggaactgccc agaagaaaat ccaaggcttt agccaggagc ggaaactgac taccatgtaa 720 

ttatcaaagt aaaattgggc attccatgct atttttaata cctggattgc tgatttttca 7 80 

agacaaaata cttggggttt tccaataaag attgttgtaa tattgaaatg agcctacaaa 840 

aacctaggaa gagataacta gggaataatg tatattatct tcaagaaatg tgtgcaggaa 900 

65 tgattggttc ttagaaatct ctcctgccag acttcccaga cctggcaaag gtttagaaac 960 

tgttgctaag aaaagtggtc catcctgaat aaacatgtaa tactccagca gggatatgaa 1020 



5 
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45 



50 



gcctctgaat tgtagaacct gcatttattt gtgactttga actaaagaca tcccccatgt 1080 

cccaaaggtg gaatacaacc agaggtctca tctctgaact ttcttgcgta ctgattacat 1140 

gagtctttgg agtcggggat ggaggaggtt ctgcccctgt gaggtgttat acatgaccat 1200 

caaagtccta cgtcaagct 1219 

<210> 8 
<211> 191 
<212> PRT 

<213> Artificial Sequence 



10 



15 



20 



25 



30 



35 

<210> 9 
<211> 10 
<212> PRT 

40 <213> Artificial Sequence 

<400> 9 

lie Thr Glu Gly Arg Lys lie Met lie Arg 
15 10 





<400> 


8 






















Leu 


Met 


Met 


Asn 


Phe 


Gin 


Pro 


Pro 


Ser 


Lys 


Ala Trp Arg 


Ala 


Ser 


Gin 


1 








5 










10 






15 




Met 


Met 


Thr 


Phe 


Phe 


He 


Phe 


Leu 


Leu 


Phe 


Phe Pro Ser 


Phe 


Thr 


Gly 








20 










25 






30 






Val 


Leu 


Cys 


Thr 


Leu 


Ala 


He 


Thr 


He 


Trp 


Arg Leu Lys 


Pro 


Ser 


Ala 






35 










40 






45 








Asp 


Cys 


Gly 


Pro 


Phe 


Arg 


Gly 


Leu 


Pro 


Leu 


Phe He His 


Ser 


He 


Tyr 


50 










55 








60 








Ser 


Trp 


He 


Asp 


Thr 


Leu 


Ser 


Thr Arg 


Pro 


Gly Tyr Leu 


Trp 


Val 


Val 


65 








70 










75 






80 


Trp 


He 


Tyr 


Arg 


Asn 


Leu 


He 


Gly Ser 


Val 


His Phe Phe 


Phe 


He 


Leu 








85 










90 






95 




Thr 


Leu 


He 


Val 


Leu 


He 


He 


Thr 


Tyr 


Leu 


Tyr Trp Gin 


He 


Thr 


Glu 








100 










105 






110 






Gly 


Arg 


Lys 


He 


Met 


He 


Arg 


Leu 


Leu 


His 


Glu Gin He 


He 


Asn 


Glu 


115 










120 






125 








Gly 


Lys 


Asp 


Lys 


Met 


Phe 


Leu 


He 


Glu 


Lys 


Leu He Lys 


Leu 


Gin 


Asp 


130 










135 








140 








Met 


Glu 


Lys 


Lys 


Ala 


Asn 


Pro 


Ser 


Ser 


Leu 


Val Leu Glu 


Arg 


Arg 


Glu 


145 








150 










155 






160 


Val 


Glu 


Gin 


Gin 


Gly 


Phe 


Leu 


His 


Leu 


Gly 


Glu His Asp 


Gly 


Ser 


Leu 










165 










170 






175 




Asp 


Leu 


Arg 


Ser 


Arg 


Arg 


Ser 


Val 


Gin 


Glu 


Gly Asn Pro 


Arg 


Ala 








180 










185 






190 







<210> 10 

<211> 9 

<212> PRT 

<213> Artificial Sequence 

<400> 10 

Leu Leu Met Asp Phe Val Phe Ser Leu 



55 <210> 11 

<211> 9 

<212> PRT 

<213>, Artificial Sequence 

60 <400> 11 

Phe Leu Leu Phe Phe Pro Ser Phe Thr 
1 5 

<210> 12 

65 <211> 9 

<212> PRT 



6 



wo 01/23417 



PCT/EPOO/09500 



<213> Artificial Sequence 



<400> 12 

Gin Met Met Thr Phe Phe He Phe Leu 
5 1 5 

<210> 13 
<211> 9 
<212> PRT 

10 <213> Artificial Sequence 



15 



20 



<400> 13 

Met Met Thr Phe Phe He Phe Leu Leu 



<210> 14 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 14 

Phe Leu He Glu Lys Leu He Lys Leu 
1 5 



25 <210> 15 

<211> 9 
<212> PRT 

<213> Artificial Sequence 



30 <400> 15 

Val Leu Leu He Arg Asn He Phe Leu 
1 5 

<210> 16 
35 <211> 9 

<212> PRT 

<213> Artificial Sequence 
<400> 16 

40 Leu Val Trp He Gly He Phe Phe Cys 
1 5 

<210> 17 
<211> 9 
45 <212> PRT 

<213> Artificial Sequence 

<400> 17 

Thr Leu Ala He Thr He Trp Arg Leu 
50 1 5 



<210> 18 

<211> 9 

<212> PRT 

55 <213> Artificial Sequence 



<400> 18 

Leu He Phe Cys Trp Asp Phe Thr Val 
1 5 

60 

<210> 19 
<211> 9 
<212> PRT 

<213> Artificial Sequence 

65 

<400> 19 



7 



wo 01/23417 



PCT/EPOO/09500 



Phe Leu Gly Glu Phe Leu Arg Arg lie 
1 5 

<210> 20 

5 <211> 9 

<212> PRT 

<213> Artificial Sequence 

<400> 20 

10 Leu Leu Leu Pro Phe Val Val Ser Cys 
1 5 

<210> 21 
<211> 9 
15 <212> PRT 

<213> Artificial Sequence 

<400> 21 

Leu Leu Thr Arg Phe Ser Ala Tyr Met 
20 1 5 

<210> 22 
<211> 9 
<212> PRT 

25 <213> Artificial Sequence 

<400> 22 

Lys Leu He Phe Cys Trp Asp Phe Thr 
1 5 

30 

<210> 23 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

35 

<400> 23 

Leu Leu Leu Met Asp Phe Val Phe Ser Leu 
1 5 10 

40 <210> 24 

<211> 10 
<212> PRT 

<213> Artificial Sequence 

45 <400> 24 

Leu Leu Met Asp Phe Val Phe Ser Leu Val 
15 10 

<210> 25 
50 <211> 10 

<212> PRT 

<213> Artificial Sequence 
<400> 25 

55 Gin Met Met Thr Phe Phe He Phe Leu Leu 
1 5 ^10 

<210> 26 
<2il> 10 
60 <212> PRT 

<213> Artificial Sequence 

<400> 26 

Tyr Leu Ala Glu Tyr Asn Leu Glu Phe Leu 
65 1 5 10 



wo 01/23417 



PCT/EPOO/09500 



<210> 27 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

5 

<400> 21 

Lys Leu He Phe Cys Trp Asp Phe Thr Val 
1 5 10 

10 <210> 28 

<211> 10 
<212> PRT 

<213> Artificial Sequence 

15 <400> 28 

Leu Leu Phe Phe Pro Ser Phe Thr Gly Val 
15 10 

<210> 29 
20 <211> 10 

<212> PRT 

<213> Artificial Sequence 
<400> 29 

25 Gin Leu Leu Thr Arg Phe Ser Ala Tyr Met 
1 5 10 

<210> 30 
<211> 10 
30 <212> PRT 

<213> Artificial Sequence 

<400> 30 

Thr Leu Val Trp He Gly He Phe Phe Cys 
35 1 5 10 

<2lb> 31 
<211> 10 
<212> PRT 

40 <213> Artificial Sequence 

<400> 31 

Ser Gin Met Met Thr Phe Phe He Phe Leu 
1 5 10 



45 



50 



<210> 32 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 32 

Val Leu Leu Leu Pro Phe Val Val Ser Cys 
15 10 



55 <210> 33 

<211> 10 
<212> PRT 

<213> Artificial Sequence 

60 <400> 33 

He Leu Cys Tyr Tyr Trp Leu Asn Thr Val 
1 5 10 

. <210> 34 
65 <211> 10 

<212> PRT 



9 



wo 01/23417 



PCT/EPOO/09500 



<213> Artificial Sequence 
<400> 34 

Thr Leu He Val Leu He He Thr Tyr Leu 
5 1 5 10 

<210> 35 
<211> 10 
<212> PRT 

10 <213> Artificial Sequence 

<400> 33 

Tyr Val Leu Leu He Arg Asn He Phe Leu 
1 5 10 

15 

<210> 36 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

20 

<400> 36 

Phe Val Phe Ser Leu Val Asn Ser Phe Leu 
1 5 10 

25 <210> 37 

<211> 10 
<212> PRT 

<213> Artificial Sequence 

30 <400> 37 

Leu Leu Leu Pro Phe Val Val Ser Cys He 
1 5 10 

<210> 38 
35 <211> 10 

, <212> PRT 

<213> Artificial Sequence 

<400> 38 

40 Leu He Gly Ser Val His Phe Phe Phe He 
15 10 

<210> 39 
<211> 10 
45 <212> PRT 

<213> Artificial Sequence 

<400> 39 

Tyr Glu Met Pro Arg His Glu Val Tyr Val 
50 1 5 10 

<210> 40 
<211> 10 
<212> PRT 

55 <213> Artificial Sequence 

<400> 40 

Leu Leu Thr Arg Phe Ser Ala Tyr Met Val 
1 5 10 

60 

<210> 41 
<211> 9 
<212> PRT 

<213> Artificial Sequence 

65 

<400> 41 



10 



wo 01/23417 



PCT/EPOO/09500 



lie Tyr Ser Gly Gly He Thr Lys Leu 
1 b 

<210> 42 

5 <211> 9 

<212> PRT 

<213> Artificial Sequence 

<400> 42 

10 Val Tyr Tyr Leu Ala Glu Tyr Asn Leu 
1 5 

<210> 43 
<211> 9 
15 <212> PRT 

<213> Artificial Sequence 

<400> 43 

Tyr Tyr Trp Leu Asn Thr Val Ala Leu 
20 1 5 

<210> 44 
<211> 9 
<212> PRT 

25 <213> Artificial Sequence 

<400> 44 

He Tyr Arg Leu Leu Leu Met Asp Phe 
1 5 

30 

<210> 45 
<211> 10 
<212> PRT 

<213> Artificial Sequence 

35 

<400> 45 

Val Tyr Val Leu Leu He Arg Asn He Phe 
1 5 10 

40 <210> 46 

<211> 10 
<212> PRT 

<213> Artificial Sequence 

45 <400> 46 

Cys Tyr Tyr Trp Leu Asn Thr Val Ala Leu 
1 5 10 

<210> 47 
50 <211> 10 

<212> PRT 

<213> Artificial Sequence 
<400> 47 

55 Tyr Tyr Leu Ala Glu Tyr Asn Leu Glu Phe 
1 5 10 

<210> 48 

<211> 9 

60 <212> PRT 

<213> Artificial Sequence 

<400> 48 

Met Leu Phe He Met Phe Tyr Ser Lys 
65 1 5 



11 



wo 01/23417 



PCT/EPOO/09500 



<210> 49 

<211> 9 ^> 
<212> PRT 

<213> Artificial Sequence 

5 

<400> 49 

Leu Leu lie Arg Asn lie Phe Leu Lys 
1 5 

10 <210> 50 

<211> 9 
<212> PRT 

<213> Artificial Sequence 

15 <400> 50 

Gly Leu Gin Glu Phe Asp He Ala Arg 
1 5 

<210> 51 
20 <211> 10 

<212> PRT 

<213> Artificial Sequence 
<400> 51 

25 Lys Met Phe Leu He Glu Lys Leu He Lys 
15 10 

<210> 52 
<211> 10 
30 <212> PRT 

<213> Artificial Sequence 

<400> 52 

He Met Leu Phe He Met Phe Tyr Ser Lys 
35 1 5 10 

<210> 53 
<211> 10 
<212> PRT 

40 <213> Artificial Sequence 

<400> 53 

Lys Leu He Lys Leu Gin Asp Met Glu Lys 
15 10 



45 



50 



<210> 54 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<400> 54 

Val Leu Leu He Arg Asn He Phe Leu Lys 
1 5 10 



55 <210> 55 

<211> 9 
<212> PRT 

<213> Artificial Sequence 

60 <400> 55 

Met Pro Arg His Glu Val Tyr Val Leu 
1 5 

<210> 56 
65 <211> 10 

<212> PRT 



12 



wo 01/23417 



PCT/EPOO/09500 



<213> Artificial Sequence 

<400> 56 

Met Pro Arg His Glu Val Tyr Val Leu Leu 
5 1 5 10 

<210> 57 

<211> 9 

<212> PRT 

10 <213> Artificial Sequence 

<400> 57 

Tyr Glu Met Pro Arg His Glu Val Tyr 
1 5 



15 



20 



<210> 58 

<211> 10 

<212> PRT 

<213> Artificial Sequence 

<400> 50 

Trp Glu Thr Leu lie Gly Gin Asp He Tyr 
1 5 10 



25 <210> 59 

<211> 9 
<212> PRT 

<213> Artificial Sequence 

30 <400> 59 

Val His Phe Phe Phe He Leu Thr Leu 



<210> €0 
35 <211> 9 

<212> PRT 

<213> Artificial Sequence 
<400> 60 

40 Leu Arg Arg He He Gly Met Gin Leu 
1 5 

<210> 61 
<211> 9 
45 <212> PRT 

<213> Artificial Sequence 

<400> 61 

Leu Leu Met Asp Phe Val Phe Ser Leu 
50 1 5 

<210> 62 

<211> 9 

<212> PRT 

55 <213> Artificial Sequence 

<400> 62 

Met Gin Leu He Thr Ser Leu Gly Leu 
1 5 



60 



65 



<210> 63 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 63 



13 



wo 01/23417 



PCT/EPOO/09500 



Phe Val Phe Ser Leu Val Asn Ser Phe 
1 5 

<210> 64 
5 <211> 9 

<212> PRT 

<213> Artificial Sequence 
<400> 64 

10 Phe lie Leu Thr Leu lie Val Leu lie 
1 5 

<210> 65 
<211> 9 
15 <212> PRT 

<213> Artificial Sequence 

<400> 65 

Trp Gin He Thr Glu Gly Arg Lys lie 
20 1 5 

<210> 66 
<2ai> 9 
<212> PRT 

25 <213> Artificial Sequence 

<400> 66 

Met Phe Tyr Ser Lys Asn He Ser Leu 
1 5 



30 



35 



<210> 67 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 67 

Phe He Met Phe Tyr Ser Lys Asn He 
1 5 



40 <210> 68 

<211> 9 

<212> PRT 

<213> Artificial Sequence 

45 <400> 68 

Trp Arg Ala Ser Gin Met Met Thr Phe 
1 5 

<210> 69 

50 <2H> 9 

<212> PRT 

<213> Artificial Sequence 

<400> 69 

55 Phe Phe Phe He Leu Thr Leu He Val 
1 5 

<210> 70 

<211> 9 

60 <212> PRT 

<213> Artificial Sequence 

<400> 70 

Phe Arg Leu Val Glu Arg Tyr Glu Met 
65 1 5 



14 



wo 01/23417 



PCT/EPOO/09500 



<210> 71 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<400> 71 

Phe Phe He Leu Thr Leu He Val Leu 
1 5 

<210> 72 

<211> 9 

<212> PRT 

<213> Artificial Sequence 

<400> 72 

Tyr Ala Gin Thr Leu Val Trp He Ala 
1 5 



15 



